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FROM THE AUTHOR’S PREFACE TO 
THE FIRST GERMAN EDITION 


T HE importance of the standpoint afforded by the theory 
of groups for the discovery of the general laws of 
quantum theory has of late become more and more 
apparent. Since I have for some years been deeply concerned 
with the theory of the representation of continuous groups, it 
has seemed to me appropriate and important to give an account 
of the knowledge won by mathematicians working in this field 
in a form suitable to the requirements of quantum physics. An 
additional impetus is to be found in the fact that, from the 
purely mathematical standpoint, it is no longer justifiable to 
draw such sharp distinctions between finite and continuous 
groups in discussing the theory of their representations as has 
been done in the existing texts on the subject. My desire to 
show how the concepts arising in the theory of groups find their 
application in physics by discussing certain of the more important 
examples has necessitated the inclusion of a short account of the 
foundations of quantum physics, for at the time the manuscript 
was written there existed no treatment of the subject to which 
I could refer the reader. In brief this book, if it fulfills its 
purpose, should enable the reader to learn the essentials of the 
theory of groups and of quantum mechanics as well as the rela- 
tionships existing between these two subjects ; the mathematical 
portions have been written with the physicist in mind, and vice 
versa. I have particularly emphasized the “ reciprocity ” be- 
tween the representations of the symmetric permutation group 
and those of the complete linear group ; this reciprocity has as 
yet been unduly neglected in the physical literature, in spite of 
the fact that it follows most naturally from the conceptual 
structure of quantum mechanics. 
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There exists, in my opinion, a plainly discernible parallels 
between the more recent developments of mathematics an 
physics. Occidental mathematics has in past centuries broke 
away from the Greek view and followed a course which seerr 
to have originated in India and which has been transmittec 
with additions, to us by the Arabs ; in it the concept of numbe 
appears as logically prior to the concepts of geometry. Th 
result of this has been that we have applied this systematicall; 
developed number concept to all branches, irrespective of whethe 
it is most appropriate for these particular applications. Bu 
the present trend in mathematics is clearly in the direction of j 
return to the Greek standpoint ; we now look upon each brand 
of mathematics as determining its own characteristic domair 
of quantities. The algebraist of the present day considers th* 
continuum of real or complex numbers as merely one “ field 
among many ; the recent axiomatic foundation of projective 
geometry may be considered as the geometric counterpart of 
this view. This newer mathematics, including the modem 
theory of groups and “ abstract algebra,” is clearly motivated 
by a spirit different from that of “ classical mathematics,” which 
found its highest expression in the theory of functions of a 
complex variable. The continuum of real numbers has retained 
its ancient prerogative in physics for the expression of physical 
measurements, but it can justly be maintained that the essence 
of the new Heisenberg-Schrodinger- Dirac quantum mechanics is 
to be found in the fact that there is associated with each physical 
system a set of quantities, constituting a non-commutative 
algebra in the technical mathematical sense, the elements of 
which are the physical quantities themselves. 

Zurich, August, ig 28 



AUTHOR’S PREFACE TO 
THE SECOND GERMAN EDITION 


D URING the academic year 1928-29 I held a professorship 
in mathematical physics in Princeton University. The 
lectures which I gave there and in other American insti- 
tutions afforded me a much desired opportunity to present anew, 
and from an improved pedagogical standpoint, the connection 
between groups and quanta. The experience thus obtained has 
found its expression in this new edition, in which the subject 
has been treated from a more thoroughly elementary standpoint. 
Transcendental methods, which are in group theory based on 
the calculus of group characteristics , have the advantage of 
offering a rapid view of the subject as a whole, but true under- 
standing of the relationships is to be obtained only by following 
an explicit elementary development. 1 may mention in this 
connection the derivation of the Clebsch-Gordan series, which is 
of fundamental importance for the whole of spectroscopy and 
for the applications of quantum theory to chemistry, the section 
on the Jordan-Holder theorem and its analogues, and above all 
the careful investigation of the connection between the algebra 
of symmetric transformations and the symmetric permutation 
group. The reciprocity laws expressing this connection, which 
were proved by transcendental methods in the first edition, as well 
as the group-theoretic problem arising from the existence of spin 
have also been treated from the elementary standpoint. Indeed, 
the whole of Chapter V — which was, in the opinion of many 
readers, much too condensed and more difficult to understand 
than the rest of the book — has been entirely re-written. The 
algebraic standpoint has been emphasized, in harmony with the 
recent development of 44 abstract algebra,” which has proved so 
useful in simplifying and unifying general concepts. It seemed 


IX 
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impossible to avoid presenting the principal part of the theory 
of representations twice ; first in Chapter III, where the repre- 
sentations are taken as given and their properties examined, 
and again in Chapter V, where the method of constructing the 
representations of a given group and of deducing their properties 
is developed. But I believe the reader will find this two-fold 
treatment an advantage rather than a hindrance. 

To come to the changes in the more physical portions, in 
Chapter IV the role of the group of virtual rotations of space 
is more clearly presented. But above all several sections have 
been added which deal with the energy-momentum theorem of 
quantum physics and with the quantization of the wave equation 
in accordance with the recent work of Heisenberg and Pauli. 
This extension already leads so far away from the fundamental 
purpose of the book that I felt forced to omit the formulation 
of the quantum laws in accordance with the general theory of 
relativity, as developed by V. Fock and myself, in spite of its 
desirability for the deduction of the energy-momentum tensor. 
The fundamental problem of the proton and the electron has 
been discussed in its relation to the symmetry properties of the 
quantum laws with respect to the interchange of right and left, 
past and future, and positive and negative electricity. At 
present no solution of the problem seems in sight ; I fear that 
the clouds hanging over this part of the subject will roll together 
to form a new crisis in quantum physics. I have intentionally 
presented the more difficult portions of these problems of spin 
and second quantization in considerable detail, as they have 
been for the most part either entirely ignored or but hastily 
indicated in the large number of texts which have now appeared 
on quantum mechanics. 

It has been rumoured that the “ group pest ” is gradually 
being cut out of quantum physics. This is certainly not true 
in so far as the rotation and Lorentz groups are concerned ; 
as for the permutation group, it does indeed seem possible to 
avoid it with the aid of the Pauli exclusion principle. Never- 
theless the theory must retain the representations of the per- 
mutation group as a natural tool in obtaining an understanding 
of the relationships due to the introduction of spin, so long as 
its specific dynamic effect is-neglected. I have here followed the 
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trend of the times, as far as justifiable, in presenting the group- 
theoretic portions in as elementary a form as possible. The 
calculations of perturbation theory are widely separated from 
these general considerations ; I have therefore restricted myself 
to indicating the method of attack without either going into 
details or mentioning the many applications which have been 
based on the ingenious papers of Hartree , Slater , Dirac and 
others. 

The constants c and h } the velocity of light and the quantum 
of action, have caused some trouble. The insight into the 
significance of these constants, obtained by the theory of rela- 
tivity on the one hand and quantum theory on the other, is 
most forcibly expressed by the fact that they do not occur in 
the laws of Nature in a thoroughly systematic development of 
these theories. But physicists prefer to retain the usual e.g.s. 
units — principally because they are of the order of magnitude of 
the physical quantities with which we deal in everyday life.. 
Only a wavering compromise is possible between these practical 
considerations and the ideal of the systematic theorist ; I 
initially adopt, with some regret, the current physical usage, 
but in the course of Chapter IV the theorist gains the upper 
hand. 

An attempt has been made to increase the clarity of the 
exposition by numbering the formulae in accordance with the 
sections to which they belong, by emphasizing the more im- 
portant concepts by the use of boldface type on introducing 
them, and by lists of operational symbols and of letters having 
a fixed significance. 

H. WEYL. 


Gottingen, November, 1930 




TRANSLATOR’S PREFACE 


T HIS translation was first planned, and in part completed, 
during the academic year 1928-29, when the translator 
was acting as assistant to Professor Weyl in Princeton. 
Unforeseen delays prevented the completion of the manuscript 
at that time, and as Professor Weyl decided shortly afterward 
to undertake the revision outlined in the preface above it seemed 
desirable to follow the revised edition. In the preparation of 
this manuscript the German has been followed as closely as 
possible, in the conviction that any alterations would but de- 
tract from the elegant and logical treatment which characterizes 
Professor Weyl’s works. While an attempt has been made 
to follow the more usual English terminology in general, this 
programme is limited by the fact that the fusion of branches of 
knowledge which have in the past been so widely separated as 
the theory of groups and quantum theory can be accomplished 
only by adapting the existing terminology of each to that of 
the other ; a minor difficulty of a similar nature is to be found 
in the fact that the development of “ fields ” and “ algebras ” 
in Chapter V is accomplished in a manner which makes it appear 
desirable to deviate from the accepted English terminology. 

It is a pleasure to express my indebtedness to Professor Weyl 
for general encouragement and assistance, to Professor R. E. 
Winger of Union College for the assistance he* has rendered in 
correcting proof and in preparing the index, and to the publishers 
for their cooperation in adhering as closely as possible to the 


original typography. 


H. P. ROBERTSON 


Princeton, September, ig3i 
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INTRODUCTION 


T HE quantum theory of atomic processes was proposed by 
Niels Bohr in the year 1913, and was based on the 
atomic model proposed earlier by Rutherford. The 
deduction of the Balmer series for the line spectrum of hydrogen 
and of the Rydberg number from universal atomic constants 
constituted its first convincing confirmation. This theory gave 
us the key to the understanding of the regularities observed in 
optical and X-ray spectra, and led to a deeper insight into the 
structure of the periodic system of chemical elements. The issue 
of N aturwissenschaften , dedicated to Bohr and entitled “ Die 
ersten zehn Jahre der Theorie von Niels Bohr liber den Bau 
der Atome ” (Vol. 11 , p. 535 (1923)), gives a short account of the 
successes of the theory at its peak. But about this time it began 
to become more and more apparent that the Bohr theory was 
a compromise between the old “classical” physics and a new 
quantum physics which has been in the process of development 
since Planck’s introduction of energy quanta in 1900. Bohr 
described the situation in an address on “Atomic Theory and 
Mechanics ” (appearing in Nature, 116 , p. 845 (1925)) in the 
words : “ From these results it seems to follow that, in the 
general problem of the quantum theory, one is faced not with 
a modification of the mechanical and electrodynamical theories 
describable in terms of the usual physical concepts, but with 
an essential failure of the pictures in space and time on which 
the description of natural phenomena has hitherto been based.” 
The rupture which led to a new stage of the theory was made 
by Heisenberg, who replaced Bohr’s negative prophecy by a 
positive guiding principle. 

The foundations of the new quantum physics, or at least 
its more important theoretical aspects, are to be treated in this 

xix 
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book. For supplementary references on the physical side 
which are urgently required, I name above all the fourth edition 
of Sommerfeld’s well-known “Atombau und Spektr allini en ” 
(Braunschweig, 1924), or the English translation “Atomic 
Structure and Spectral Lines” (London, 1923) of the third 
edition, together with the recent (1929) “ Wellenmechanischer 
Erganzungsband ” or its English translation “ Wave Mechanics ” 
(1930). An equivalent original English book is that of Ruark 
and Urey, “ Atoms, Molecules and Quanta ” (New York, 1930), 
which appears in the “ International Series in Physics,” edited 
by Richtmeyer. I should also recommend Gerlach’s short 
but valuable survey “ Experimentelle Grundlagen der Quanten- 
theorie" (Braunschweig, 1921). The spectroscopic data, pre- 
sented in accordance with the new quantum theory, together 
with complete references to the literature, are given in the 
following three volumes of the series “Struktur der Materie,” 
edited by Born and Franck: — 

F. Hund, “ Linienspektren und periodisches System der 
Elemente” (1927); 

E. Back and A. Lande, “ Zeemaneffekt und Multiplett- 
struktur der Spektrallinien” (1925); 

W. Grotrian, “ Graphische Darstellung der Spektren von 
Atomen und Ionen mit ein, zwei und drei Valenzelektronen ” 
(1928). 

The spectroscopic aspects of the subject are also discussed 
in Paulino and Goudsmit’s recent “The Structure of Line 
Spectra ” (1930), which also appears in the “International 
Series in Physics.” 

The development of quantum theory has only been made 
possible by the enormous refinement of experimental technique , 
which has given us an almost direct insight into atomic 
processes. If in the following little is said concerning the 
experimental facts, it should not be attributed to the mathe- 
matical haughtiness of the author; to report on these things 
lies outside his field. Allow me to express now, once and for 
all, my deep respect for the work of the experimenter and for 
his fight to wring significant facts from an inflexible Nature, 
who says so distinctly “ No and so indistinctly “ Yes ” to 
our theories. 
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xxi 


Our generation is witness to a development of physical 
knowledge such as has not been seen since the days of Kepler, 
Galileo and Newton, and mathematics has scarcely ever 
experienced such a stormy epoch. Mathematical thought 
removes the spirit from its worMly haunts to solitude and 
renounces the unveiling of the secrets of Nature. But as 
recompense, mathematics is less bound to the course of worldly 
events than physics. While the quantum theory can be traced 
back only as far as 1900, the origin of the theory of groups 
is- lost in a past scarcely accessible to history ; the earliest 
works of art show that the symmetry groups of plane figures 
were even then already known, although the theory of these 
was only given definite form in the latter part of the eighteenth 
and in the nineteenth centuries. F. Klein considered the 
group concept as most characteristic of nineteenth century 
mathematics. Until the present, its most important application 
to natural science lay in the description of the symmetry of 
crystals, but it has recently been recognized that group theory 
is of fundamental importance for quantum physics ; it here 
reveals the essential features which are not contingent on a 
special form of the dynamical laws nor on special assumptions 
concerning the forces involved. We may well expect that it is 
just this part of quantum physics which is most certain of a 
lasting place. Two groups, the group of rotations in y dimen- 
sional space and the permutation group , play here the principal 
role, for the laws governing the possible electronic configurations 
grouped about the stationary nucleus of an atom or an ion are 
spherically symmetric with respect to the nucleus, and since the 
various electrons of which the atom or ion is composed are 
identical, these possible configurations are invariant under a 
permutation of the individual electrons. The investigation of 
groups first becomes a connected and complete theory in the 
theory of the representation of groups by linear transformations , 
and it is exactly this mathematically most important part 
which is necessary for an adequate description of the quantum 
mechanical relations. All quantum numbers , with the exception 
of the so-called principal quantum number , are indices character- 
izing representations of groups. 
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This book, which is to set forth the connection between groups 
and quanta , consists of five chapters. The first of these is 
concerned with unitary geometry. It is somewhat distressing 
that the theory of linear algebras must again and again be 
developed from the beginning, for the fundamental concepts 
of this branch of mathematics crop up everywhere in mathe- 
matics and physics, and * a knowledge of them should be as 
widely disseminated as the elements of differential calculus. 
In this chapter many details will be introduced with an eye 
to future use in the applications ; it is to be hoped that in 
spite of this the simple thread of the argument has remained 
plainly visible. Chapter II is devoted to preparation on the 
physical side; only that has been given which seemed to me 
indispensable for an understanding of the meaning and methods 
of quantum theory. A multitude of physical phenomena, which 
have already been dealt with by quantum theory, have been 
omitted. Chapter III develops the elementary portions of the 
theory of representations of groups and Chapter IV applies them 
to quantum physics. Thus mathematics and physics alternate 
in the first four chapters, but in Chapter V the two are fused 
together, showing how completely the mathematical theory is 
adapted to the requirements of quantum physics. In this last 
chapter the permutation group and its representations , together 
with the groups of linear transformations in an affine or unitary 
space of an arbitary number of dimensions, will be subjected to 
a thorough going study. 
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CHAPTER I 


UNITARY GEOMETRY 

§1. The /i-dimensional Vector Space 

T HE mathematical field of operation of quantum mechanics, 
as well as of the theory of the representations of groups, 
is the multi-dimensional affine or unitary space. The 
axiomatic method of developing the geometry of such a Space 
is no doubt the most appropriate, but for the sake of clearness 
I shall at first proceed along purely algebraic lines. I begin 
with the explanation that a vector g in the w-dimensional 
linear spaced = 9t n is a set of n ordered numbers (x Xi x 2 * • *,x n ); 
vector analysis is the calculus of such ordered sets. The two 
fundamental operations of the vector calculus are the multiplica- 
tion of a vector g by a number a and the addition of tzvo vectors g 
and t). On introducing the notation 

£ = (#l> x 2i * * *> X n)) ^ (Yh y * * *) Vn) 

these operations are defined by the equations 

:sss (ax x , ax 2 , * * *, aXf^jy g -j- t) === ( x-y -j- y^ } x% T* y 2 , * * *, 

x n + Vn)- 

The fundamental rules governing these operations of multiplica- 
tion by a number and addition are given in the following table 
of axioms, in which small German letters denote arbitrary 
vectors and small Latin letters arbitrary numbers : 

(a) Addition . 

1. a + b = b + a ( commutative law). 

2. (a + b) + c = a + (6 + c) ( associative law). 

3. a and c being any tzvo vectors } there exists one and only one 
vector g for which a + g = c. It is called the difference c — a of 
c and a (possibility of subtraction). 
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(/?) Multiplication . 

1. (a + = (#j) + (ig) (j6r$f distributive law). 

2. = [ab ) J [associative law). 

3. lj = S. 

4. a(j + t|) = (aj) + (aty) [second distributive law). 

The existence of a vector 0 = (0, 0, • • •, 0) with the property 
S+0=0+s = % 

need not be postulated separately as it follows from the axioms. 

Affine vector geometry concerns itself entirely with concepts 
which are defined in terms of the two fundamental operations 
with which the axioms (a) and (j8) are concerned ; we mention 
a few of the most important. A number of vectors a Xl o 2 , • • •, a h 
are said to be linearly independent if there exists between them 
no homogeneous linear relation 

c x a x + c 2 a 2 + ■ • • + c h a h = 0 

except the trivial one with coefficients 

c i — 0, c 2 — 0, • • •, c h = 0. 

h such vectors are said to span an h-dimensional ( linear ) sub- 
space SR' consisting of all vectors of the form 

l = + £2^2 + • • • + 1*1) 

where the f*s are arbitrary numbers. It follows from the 
fundamental theorem on homogeneous linear equations that 
there exists a non-trivial homogeneous relation between any 
h + I vectors of SR'. The dimensionality h of SR 7 can therefore 
be characterized independently of the basis : every h + 1 vectors 
in SR' are linearly dependent, but there exist in it h linearly 
independent vectors. Any such system of h independent 
vectors a lf a 2 , • • •, a* in 91' can be used as a co-ordinate system 
or basis in SR' ; the coefficients f 1} £ 2 , •••,£* in the representation 
(1.1) are then said to be the components of g in the co-ordinate 
system (a* a* • • •, d h \. 

The entire space SR is w-dimensional, and the vectors 

e i = (1, 0, 0, • • •, 0), 

e 2 = (0, 1, 0, • • •, 0), 

ej— (0* 0/0/- \ 1) 

define a co-ordinate system in it in which the components of a 
vector 



l = K • • •, *n) 



THE w-D IMENS IONAL VECTOR SPACE 


3 


agree with the “ absolute components ” x { : 

l = + • • • + *»e n . 

From the standpoint of affine geometry, however, the “ absolute 
co-ordinate system ” (1.2) has no*preference over any other which 
consists of n independent vectors of SR. We now add to the 
previous axioms, which did not concern themselves with the 
dimensionality n, the following dimensionality axiom : 

(y) The maximum number of linearly independent vectors in SR 
is n. 

These axioms (a), (/J), and (y) suffice for a complete formula- 
tion of vector calculus, for if e h e 2 , • • •, e n are any n independent 
vectors and £ is any other vector there must necessarily exist 
a linear dependence 

ajC + + ^ 2 e 2 + * # * + &tfin = 0 

between them. Since not all the coefficients may vanish we 
must in particular have a 4= 0, and consequently any vector £ 
can be expressed as a linear combination 

S = %lll + #2^2 4* • • * + #n e n (1*3) 

of the “ fundamental vectors ” e lt e 2 , • • •, e n . We specify £ by 
the set (x lf x 2 , • • •, x n ) of components in this co-ordinate system. 
In accordance with axioms (a) and (jS) for addition and multi- 
plication we then have for any two vectors (1.3) and k) 

aj=(^ 1 )e 1 H b K)e„ S+t )=(x x +y 1 )t 1 -\- f (*„+;y„)e B) 

and we arrive at the definitions from which we started. The 
only — but important — difference between the arithmetic and 
the axiomatic treatment is that in the former the absolute co- 
ordinate system (1.2) is given the preference over any other, 
whereas in the latter treatment no such distinction is made. 
Given any system of vectors, all vectors £ which are obtained, 
as (1.1), by linear combinations of a finite number of vectors 
a 1} & 2 , • • •, a h of the system constitute a (linear) sub-space — the 
sub-space “ spanned ” by the vectors a. 

91 is said to be decomposed or reduced into two linear sub- 
spaces SR', SR" (SR = SR' + SR") if an arbitrary vector £ can be 
expressed uniquely as the sum of a vector £' of SR' and a vector 
£" of SR". A co-ordinate system in SR' and a co-ordinate system 
in SR" constitute together a co-ordinate system for the entire 
space SR; this co-ordinate system in SR is “adapted” to the 
decomposition SR' + SR". The sum n' + n" of the dimension- 
alities of SR' and SR" is equal to n, the dimensionality of SR. 
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Conversely, if the sub-spaces SR', SR" have no vector except 0 
in common, and if the sum of their dimensionalities is n then 

SR = SR' + SR". 

JR' being an M-dimensional sub-space, two vectors g and t) are 
said to be congruent modulo SR' : 

g s t) (mod. SR'), 

if their difference lies in SR'. Congruence satisfies the axioms 
postulated of any relation of equality : every vector is congruent 
to itself ; if £ = t) (mod. SR') then t) a g (mod. SR') ; if g = b 
(mod. SR') and ^ sj (mod. SR'), then 5 sj (mod. SR'). It is 
therefore permissible to consider vectors which are congruent 
mod. SR' as differing in no wise from one another ; by this ab- 
straction, which we call projection with respect to SR' the 
M-dimensional space SR gives rise to an ( n — n ') -dimensional 
space SR. SR is also a vector space, for from 

Ei s E2, tyi = (mod. SR') 
follow the relations 

agi = ag 2 , Ei + tyi s Ea 4* 9a (mod. SR'). 

The operations of multiplication by a number and addition can 
therefore be considered ones which operate directly on the 
vectors g of SR. All vectors g of SR which are congruent mod. SR' 
give rise to the same vector g of SR. If SR' is one-dimensional 
and is spanned by e the above process is the familiar one of 
parallel projection in the direction of e ; it is not necessary to 
give an (n — 1) -dimensional sub-space of SR on to which the 
projection is made. 

If a is a non-null vector, all vectors g which arise by multi- 
plying a by a number are said to lie on the same j ray as a. Two 
non-null vectors determine the same ray when, and only when 
one is a multiple of the other. In a given co-ordinate system 
the vector a is characterized by its components a,, a, • • • a 
whereas the ray a is characterized by their ratios a x : a t : ’• • • :& ” 
these, ratios have meaning only when the components of a do 
not all vanish, i.e. only when a 4= 0. 

The transition from one co-ordinate system e< to another tJ is 
accomplished by expressing the new co-ordinate vectors e/ in 
terms of the old : 
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If % u x- are the components of an arbitrary vector % in the old 
and in the new co-ordinate systems, respectively, then 

S = IX = ZXX', 

i k 

from which the law of transformation 

H 

Xi = £a ik x k ' ( 1 . 4 ) 

& = 1 

follows. The requirement that the co-ordinate vectors also 

be linearly independent is expressed arithmetically by the non- 
vanishing of the determinant of the coefficients a ik . The com- 
ponents of vectors j, t), - • • in 91 undergo the same transformation 
on transition to the new co-ordinate system e/ and are said to 
transform cogrediently . 

§2. Linear Correspondences. Matrix Calculus 

The formula (1.4) can, however, be otherwise interpreted ; 
it is the expression of a linear or affine correspondence or 
mapping of the space 91 on itself. But for this purpose it 
will be found more convenient to interchange the roles of the 
accented and the unaccented co-ordinates. On employing a 
definite co-ordinate system e iy the equation 

Xi = 2X* x k (2.1) 

Jfc- 1 

associates with an arbitrary vector £ with components x € a vector 
j' with components x{ , This correspondence A : $ -» j' of 91 on 
itself can be characterized as linear by the two assertions : if 
$, t) go over into £', t)', then goes over into ag and j -J- t) into 
j' + ty'. Linear correspondences therefore leave all affine rela- 
tions unaltered ; hence their prominence in the theory of affine 
geometry. In order to show that these two conditions fully 
determine the linear correspondence (2.1), consider the following : 
if a correspondence A which satisfies these conditions sends the 
fundamental vector e* over into 

= ( 2 - 2 ) 
i 

then, in consequence of the above requirements, 

j = x x t x + ■ • ■ + x n e n 

goes over into 

l’ — x x tj. 1 + • • • + x n e„'. 
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On substituting (2.2) in this equation we see that the new vector 
j' has in the co-ordinate system e< the components obtained 
from the components %i of j by means of (2.1). It has become 
customary in quantum physics to call the linear correspondences 
of a vector space 91 operators which operate on the arbitrary 
vector J of 8t. 

Let A, B be two linear correspondences, the first of which 
sends the ’arbitrary vector % over into j' = Ag, while the second 
sends j' into %" = Bg' = B(A%). The resultant correspondence 
C, which carries £ directly into £", is also linear and is denoted 
by (BA) (to be read from right to left !) : 

(BA)% = B(A]c). 

This “ multiplication ” satisfies laws which are similar to those 
of multiplication of ordinary numbers ; in particular, the as- 
sociative law 

C(BA) = ( CB)A 

is here valid, but the commutative law is not — in general 
AB =|= BA. The “ 1 ” in this domain, which we here denote by 
1, is the identity, i.e. that correspondence which associates every 
vector £ with itself :£->£. Hence 

A1 = 1A = A. 

The correspondence A is then and only then reversible in case 
it is non-degenerate, i.e. if it carries no non-vanishing vector into 
the vector 0, or if distinct vectors are always carried over into 
distinct ones. The algebraic condition for this is the non- 
vanishing of the determinant |a a | = det A ; there then exists 
the inverse correspondence A~ l : 

AA~ l = A~ l A = 1 . 

The multiplication theorem for determinants states that 
det (BA) = det B • det A. 

Not only can we “ multiply ” two correspondences, we can 
also “ add ’’ them. This concept of addition arises quite natur- 
ally : if the arbitrary vector £ is sent over into j/ by A and into 
j 2 ' by B, then that correspondence which sends J into + £ 2 ' is 
also linear and is denoted by A + B : 

(A + B)i = Ai + B%. 

We may also introduce multiplication by an arbitrary number 
a: aA is that correspondence which sends £ into a(A%). Addition 
and multiplication by a number obey the same laws as the 
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analogous operations on vectors. Addition is commutative, 
and has as its inverse subtraction. The role of 0 is played by 
the correspondence 0 which transforms every vector £ into the 
vector 0. Addition obeys the distributive law with respect to 
multiplication : 

(A + B)C = AC + BC, C(A + B) = CA + CB, 

(« aA)C = a(AQ, C(aA) = a(CA). 

Before proceeding to the arithmetical expression of these 
operations in a given co-ordinate system, we consider another 
natural generalization. We can map an wz-dimensional vector 
space SR linearly on an n-dimensional space © ; this is accom- 
plished when with each vector £ of SR a vector 1) of © is associated 
in such a way £ -> fc) that from £ x -> £ 2 t) 2 it follows that 

a h dh, Ei + E* 

Such a correspondence A : £ t) is expressed by equations of 
the form 

tn 

Vk = IX* *< (A = 1, 2, • • •, n) (2.3) 

where x lf • • % are the components of £ in a given co-ordinate 
system in the space Sft and y lf • • •, y n have the corresponding 
interpretation in ©. With this correspondence A there is 
associated the matrix 


a il a l2 • • 


&21 #22 • • 

• #2m 

#nl #n2 • • 

■ • 


with n rows and m columns, and which we also denote by 
the same letter A . The first index indicates the row and the 
second the column to which a ki belongs. We can also add corre- 
spondences of the same space SR on the same space ©. Addition 
and multiplication by a number is accomplished on matrices by 
subjecting their n • m components to these operations : if 

A = || a ki || and B = || b ki || 

then 

aA = || a * a ki ||, A + B = || a ki + b ki ||. 

If we have a third (^-dimensional) vector space % r the consec- 
utive application of the correspondences A : £ -> ty of SR on © and 
B : t) $ of © on 2 gives rise to the correspondence C = BA : £ j 
of SR on S. This composition is expressed in terms of matrix 
components by the law 
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B has p rows and n columns and A n rows and m columns ; the 
composition of matrices is possible when the first factor B has 
the same number of columns as the second factor A has rows. 
The component or element c Ul which is found at the intersection 
of the F 1 row and the i th column, is formed in accordance with 
(2.4) from the components in the F* row of B and the column 
of A. An important special case is that in which % is the same 
space as SR ; A is then a correspondence of ?R on ©, B of (3 on SR. 
Already here concepts of the theory of groups play an important 



r61e ; on beginning Chapter III, which deals with the theory of 
groups, the reader should return to the matter here discussed 
as an illustration. 

The matrix calculus allows us to express the formulae for 
a linear correspondence, such as (2.3), in an abbreviated form. 
We do this by denoting by x that matrix whose only column 
consists of the vector components x h x 2t • • •, x m ; similarly 
for y. In accordance with the rule (2.4) for the composition of 
matrices, equations (2.3) can be written 

y = Ax. 


(2.5) 
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This form is particularly useful in examining the effect on the 
matrix A of a linear correspondence of a space S on a space © 
when the original co-ordinate systems are replaced by new ones. 
If this change of co-ordinates is effected by the transformations 

%t = JJSij or x = Sx' in 9i, 
i 

7k = Stkh 7h or y = Ty' in ©, 

h 

then from (2.5) 

Ty' = ASx' or y' = {T~ l AS)x'. 



The same correspondence in the new co-ordinates is therefore 
expressed by the matrix 

A' - r-MS. (2.6) 

Let us now return to the linear correspondence A of a space 
9t on to itself* If 9T is a linear n'-dimensional sub-space of 9T 
we say that A leaves 9f' invariant if it carries any vector of 91', 
over into a vector of 9t'. If the co-ordinate system is so chosen 
that the first n' fundamental vectors lie in 91', the matrix of 
a correspondence which leaves 91' invariant will assume the 
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form given by Fig. I. All elements in the rectangle of n' columns 
and n — v! rows denoted by zeros in Fig. I, vanish. A contains 
a correspondence of 91' on to itself and at the same time a corre- 
spondence of the space 9t, arising by projecting JR with respect 
to SR', on to itself. The matrices of these correspondences con- 
sist in the shaded squares. If JR is decomposed into 9f? x -f 
(«j + « 2 = «), and if the correspondence A leaves both sub- 
spaces 9?j and JR* invariant, then A is completely reduced 
into a correspondence of SR X on itself and a correspondence of 
JR S on to itself. If the co-ordinate system is adapted to the 
decomposition JR t + JR*, the matrix A is completely reduced into 
two square matrices arranged along the principal diagonal as 
in Fig. 2. The unshaded rectangles are empty — the elements 
situated in these portions are all zero. 

Let the n-dimensional linear space JR be decomposed into 
sub-spaces JRi + JR 2 + • • •, JR« having the dimensionality n« ; n is 
then equal to the sum + «*+•• •• Any vector j can then be 
written uniquely as the sum of components + £* + * * * which 
lie in the sub-spaces JR X , JR 2 , • • -. The association j j a is 
a linear correspondence E a of JR on to JR«. Given a correspond- 
ence A : £ £' of JR on to itself, we consider that linear corre- 

spondence [A]*f, which carries an arbitrary' vector £ of JR B over 
into the component £«' in JR« of £'. We call [A] aft the portion of 
A in which SR« intersects JR B . This terminology arises from the 
matrix representation of A ; on adapting the co-ordinate system 
to the decomposition JR X + JR a + • • • the set of variables x t , or 
rather their indices i which number the rows and columns of 
the matrix, is broken up into segments of lengths n« (« = 1, 2, - • 

The matrix A is thereby divided into the single rectangles 
[A] afi in which the a a set of rows intersects the £ th set of columns, 
and which consist of w* • elements. 

If A is the matrix of a correspondence of JR on to itself in 
a given co-ordinate system, and A' its matrix in a co-ordinate 
system obtained from the first by means of the reversible 
transformation S, then in accordance with (2.6) 


A' = S~ l AS. (2.7) 

The search for an invariantive characterization of correspondences 
may be formulated algebraically : to find expressions which 
are so formed from the components of an arbitrary matrix that 
they assume the same value for equivalent matrices, i.e. for 
matrices A, A! between which a relation (2.7) exists. The way 
in which this can be accomplished is indicated by the related 
problem of finding a vector j =j= 0 which is transformed into 
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a multiple Aj of itself under the influence of A. The column x 
of the components of £ must then satisfy the equation 

Xx = Ax } or (A1 — A)x = 0. 

But n linear homogeneous equations in n unknowns have a 
non-vanishing solution only if their determinant vanishes ; the 
multiplier A is therefore necessarily a root of the “ characteristic 
polynomial ” 

/(A) = det (A1 - A) (2.8) 

of A . This polynomial is an invariant in the above sense, for 
from (2.7) or SA' = AS it follows that 

S( A1 - A') = (A1 - A)S, 

whence by the theorem concerning the multiplication of deter- 
minants 

det 5 • det (A1 — A') = det (A1 — A) • det 5. 

Since the determinant of the reversible transformation 5 cannot 
vanish, we can divide by it and obtain the required identity 

\XL-A'\ ■ | A1 — 1. 

The characteristic polynomial is of degree n in A : 

/(A) = A* — s x A"- 1 + • • • ± s B 

whose coefficients, certain integral functions of the elements 
a ik} are invariants of the correspondence A. The “ norm ” s n 
is merely the determinant of A . The first coefficient s h the 

trace 

*1 = ^11 + #22 + ’ • * + #nn = t tA (2.9) 

is of more importance, as it depends linearly on the a ik : 
tr(A l + A 2 ) = tr A x + tr A* 

If A is a linear correspondence of the w-dimensional vector 
space SR on the n-dimensional space ©, and B is conversely a 
linear correspondence of © on SR, then we can build the corre- 
spondences BA of SR on to itself and AB of © on to itself. These 
two correspondences have the same trace 

tr (BA) = tr (AB) (2.10) 

for, in accordance with the rule of composition (2.4) and the 
definition (2.9) we have 

tr (BA) = JBb ik a ki , tr (AB) = £a ki bi k 

i,k i f k 
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where i runs from 1 to m and k from 1 to n . The special 
in which A and B are both correspondences of SR on to i 
naturally deserves particular consideration. 

§ 3. The Dual Vector Space 

A function X(j) of the arbitrary vector £ of the form 

ccyX x + a^ 2 + ‘ ‘ ( 

is called a linear form. This concept is invariant in the sens 
affine geometry : it can be defined by means of the functic 
properties 

L(*t> = a • L( j), L(l + t>) = m + m. 

It is obvious that the expression (3.1) has these properties, 
conversely, on introducing a co-ordinate system e t and seti 
j = Zxti, it follows that 

m = £*i L{t t ) = EctiXi ; a-i — L(e { ). 

» t | 

On going over to another co-ordinate system such that 
components x t of an arbitrary vector £ undergo the transfori 
tion (1.4), the linear form becomes 

Z*iXi = Zo t/xf 

the coefficients a/ of which are related to the original a* by 
equations 

a*' = Z&oc' <**• 

i 

The coefficients ol { of a linear form are said to transform contx 
grediently to the variables 

It is, however, not necessary to consider the oc* as consta 
and the x { as variables. When the oc,* do not all vanish the eqi 
tion L(£) = 0 defines a “ plane,” i.e. an (n — l)-dimensio: 
sub-space ; a vector £ lies in the plane if its components sati 
this equation. But on the other hand we can ask for the equati 
of all planes which pass through a given non- vanishing vector ; 
the %i = Xi° are then constants and the a< variables. It is the 
fore most appropriate to consider the two sets (x 1} x Zj • • *, x 
(<*i, <* 2 , # * 'i «n) in parallel. 

We therefore introduce in addition to the space SR a seco 
^-dimensional vector space, the dual space P. From the co, 
ponents (&, * * *, £n) of a vector f of P and a vect 

(x lf x zi " ’ % %n) of SR we can construct the inner or scalar prod\ 

it x \ + + * * * + £n*n (3 
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This product has, by definition, an invariantive significance, for 
when SI is referred to a new co-ordinate system by means of 
a transformation of the x { the variables & of the dual space P 
undergo the contragredient transformation. This dual space is 
in fact introduced in order to enable us to associate a contra- 
gredient transformation with each one-to-one transformation. 
To repeat, two linear reversible transformations 

x = Ax\ £ = Af (3.3) 

are contragredient with respect to each other if they leave (3.2) 
unaltered : 

£l#l+£2#2+* * * + £n X n = ilXi+ £ 2 X 2 + * * * + £«'#/• (3.4) 

A vector $ of SI and a vector £ of P are said to be in involution 
when their product (3.2) vanishes. A ray in St determines a 
plane in P, i.e. the plane consisting of the vectors which are in 
involution with the given ray, and conversely. Duality is 
a reciprocal relationship.*)* 

The dual or transposed matrix A* of a matrix A = \\a ki \\ 
is obtained by interchanging the rows and columns of A. 
A* = 1 1 ^| | is therefore defined by a* k = a ki , and has m rows 
and n columns. We shall always employ the asterisk to in- 
dicate this process. And what is its geometrical interpretation ? 
Let SI be an w-dimensional, © an n- dimensional, vector space ; 
A : J h a linear correspondence of SI on @, specified in terms 
of given co-ordinate systems in SI and © by the matrix A : 

y k = S^ki *i, 

i 

and let P, E be the dual spaces. The product 
Hvkyk= ■>?**<( = £&%{), 

k k,i i 

where rj is an arbitrary vector of E with components Y) ki has then 
an invariantive significance. A bilinear form which depends 
linearly on a vector rj of E and a vector J of Si is therefore in- 
variantively associated with a linear correspondence of Si on @, 
and conversely. This gives rise, as the expression of the bi- 
linear form given in parentheses shows, to a correspondence 

*)-+£• £i = U^kiVk 
k 

of E on P, i.e. the dual A* of A. The reciprocal relation existing 
between the correspondence A and its dual A* may be expressed 

t In the theory of relativity it is usual to call vectors in 01 and P contra- 
variant and covariant vectors , respectively. 
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as follows : if j is an arbitrary vector in 5ft and rj is an arbitrary 
vector in 2, then the product of the vectors Ag and tj is equal 
to the product of £ and A*rj. The dual correspondences obey 
the linear laws 

(A + A)* = Aj* + Af, (aA)* = a-A \ 

If A is a correspondence of 91 on © and B a correspondence of 
© on %, then since 

(BA)* = A*B* (5.5) 

BA maps 9t linearly on % and A*B * maps the dual space T 
of % on the dual P of 5ft. 

We have agreed once and for all to consider the set 
x 1} %% * • *, % n of components of a vector £ as a column ; the 
inner product of the vector J in 5ft with the vector f in P can 
therefore be written in matrix notation as £*x or x*£. The 
transformations (3,3), from the first of which it follows that 
x* = x'*A*, are consequently contragredient to one another if 

A*A = 1 or A = (. A*)~\ (3.6) 

and we have arrived at an explicit expression for the contra- 
gredient transformation. 

Let 5ft' be an ^'-dimensional sub-space of 5R = 9t«. All 
vectors of P which are in involution with the totality of vectors 
of $ft' obviously constitute, in consequence of the simplest 
theorems on linear homogeneous equations, an ( n — /^-dimen- 
sional sub-space P' of P. And from this we are led immediately 
to the result that if a correspondence A of 'Si on itself leaves the 
sub-space 5ft' invariant , then the dual correspondence A* of P on 
itself leaves the associated sub-space P' invariant 

Let 5R be decomposed into two or more sub-spaces 
3ti + Sta + • • • °f dimensionalities n 1} n 2 , • • •, and let the. 
sub-space of P which consists of all vectors in involution with 
all vectors of 9t a + 5K 3 -f • * - be denoted by P l3 the dimension- 
ality of which is also n v Defining P s , P 3 analogously, we arrive 
at the decomposition P = Pj -f P 2 + • • *, for the sum of a 
vector of P 3 a vector of P 2 , etc., can only ' be, zero when each 
of the individual summands vanishes. In order to prove this 
latter statement, we note that if the sum is 0 then the first 
summand belongs to P a as well as to P 2 +■ P 3 + * • •, i.e. it is 
in involution with all the vectors of Sft 2 + 5ft s + • * • as well as 
with all those of 5ft 3 , and is therefore in involution with all the 
vectors of 5ft. But this is only possible if this first, and therefore 
any, summand is zero. P x can be considered as the space dual 
to 5fti, for if £ is an arbitrary vector in 5ft x and rj a vector in P 
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with components rjW in the various P*, then the product of 
% and rj is equal to the product of £ and 

If a correspondence A of 31 on itself leaves the n'-dimensional 
sub-space 31' invariant, then the (n — n')-dimensional sub-space 
P' is invariant under the dual correspondence A* of P on itself. 
If 9ft is decomposed into 9fti + 9ft 2 + * * * and if A leaves each 
of the sub-spaces 9ft* invariant, then A* leaves each of the sub- 
spaces P* invariant. If A is any correspondence in 9ft and [A] a p 
that portion in which 9ft* intersects 9ft / j, then the portion [*4*]^* 
of A * in which P^ intersects P* is dual to [A] afi : 

[A*h« = MV (3.7) 

[ 'A]*p maps 9ft^ on 9ft* and [A*]p 9 maps the dual space P* on P^. 

All these results are conceptually evident, but can be seen 
even more readily directly from the matrices on adapting the 
co-ordinate system to the decomposition 9ft x + 9ft 2 + * ■ \ 

§4. Unitary Geometry and Hermitian Forms 

The metric is introduced into affine geometry by means of 
a new fundamental concept : the absolute magnitude of a vector . 
In Euclidean geometry the sum of the squares 

f = Xl * + x 2 * + • • • + x n 2 (4.1) 

of the components of a vector J = (%, x 2 , • • •, x„) is taken as 
the square of its absolute value. The only co-ordinate systems 
which are then equally permissible are the Cartesian systems, 
in which the square of the absolute value of £ is given by (4.1) 
in terms of the components x { ; the range of values which the 
components may here assume is taken as the continuum of all 
real numbers. But the content of the preceding paragraphs 
is not bound to this choice ; the only requirement is, in fact, 
that the range of permissible values constitute a “ field ” in 
which the four fundamental operations (excluding division by 
zero) can be performed. We shall hereafter consider the con- 
tinuum of all complex numbers as the range of values which our 
components may assume. The expression (4.1) loses its definite 
character in this domain ; the sum of the squares can vanish 
without implying that each term is zero. It is therefore desirable 
to replace the quadratic form (4.1) by the “ unit Hermitian 
form ” 

* 1*1 + X 1 X 2 + ’ • ' + £ n x n ( 4 - 2 ) 

where x denotes the complex conjugate of a number %. The 
value £ a of (4.2) will be taken as the square of the absolute 
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magnitude of the vector £ = (* lf x 2) • • •, x n ) and the correspond- 
ing bilinear form 

(ft) = x&i + + • * • + *#n 

as the scalar product (jt)) of the two vectors £ and t) = 
(yi> V 2 * * * % Vn)- A co-ordinate system is said to be normal 
when the square of the absolute magnitude of a vector £ is 
expressed in terms of its components x { in this co-ordinate 
system by (4.2)* In a normal co-ordinate system e< these 
components are the scalar products 

%i = ( 4 * 3 ) 

The transformations which lead from one normal co-ordinate 
system to another such, which therefore leave the form (4.2) 
invariant, are called unitary transformations; f 

The conditions which characterize unitary transformations 
are entirely analogous to those for orthogonal transformations, 
with which we are familiar from the elements of analytic geo- 
metry. Let x = Sx* be such a transformation ; under the 
influence of 5 the fundamental metric form (4.2) goes over into 

x'*S*Sx'. S is therefore unitary if and only if S*S = 1 ; the 
fact that det 5* 4= 0 follows immediately from* this. Indeed, 
since a matrix S and its transposed S* have the same deter- 
minant, it follows that the determinant of a unitary transformation 
has the absolute value 1 : jdet S\ 2 = 1. These conditions may 

be expressed by the assertion that S* is the matrix S'" 1 reciprocal 

to 5, and therefore not only S*S = 1 but also £5* = 1. The 
first of these equations states that the sum of the squares of 
the absolute values of the elements of a column is 1 and that 
the sum of the mixed products SsriSnc of two different columns 

(i 4= k) is 0 ; the second equation contains the same assertion 
for the elements of the rows. 

We carry over the terminology usual in Euclidean geometry. 
In particular, the vector t) is said to be perpendicular to £ if 
the scalar product (#>) vanishes. In virtue of the symmetry law 

(%) = (Stj) 

perpendicularity is a reciprocal relationship. There exists no 
vector a, except (t = 0, to which all vectors are perpendicular ; 
in fact, a = 0 is the only vector which is perpendicular to itself. 
Normal co-ordinate systems can be characterized by the fact 

| The name " orthogonal ” has been used in the physical literature to 
denote these transformations, but in mathematics it is necessary to have 
different names for these two different concepts. 
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that for them the scalar products of the fundamental vectors 
e* among themselves are 

(e,- e„) = S ifc = | 0 ^ ^ 

On comparing the fundamental metric form (4.2) with (3.2) 
it is seen that the unitary space SR can be characterized by the 
fact that its conjugate complex 91 coincides with its dual P, or 

more precisely, that the conjugate complex £ of a vector £ can 
at the same time be considered as its dual. We found that with 
a correspondence A of an m-dimensional unitary space 91 on 
an w-dimensional © is associated in an invariant manner the 
correspondence A* of the dual space 2 on the dual P. As a 
consequence of the equation P = 91 for unitary spaces 

A* = A 

is a correspondence of © on 91; we call it the “ Hermitian 
conjugate of A." id is a correspondence of 9? on itself, 
A A of © on itself. A correspondence © which carries the 
general vector £ over into £' — 5£ is unitary if it leaves the 
absolute magnitude of £ unaltered : £' 2 = J 2 . Two configura- 
tions consisting of vectors, either of which can be obtained from 
the other by a unitary transformation, are congruent in unitary 
geometry ; i.e. unitary geometry is the theory of those relation- 
ships which are invariant under an arbitrary unitary transforma- 
tion. The characteristic property of such transformations is 
expressed in terms of the matrix calculus by either of the two 
equations 

55 = 1, 55 = 1. 

Let 91' be an w-dimensional linear sub-space spanned by 
the linearly independent vectors fli, * ‘ consider 

a vector £ as belonging to the sub-space 91" if and only if it is 
perpendicular to 91', i.e. to all the vectors of 91' ; such a vector 
must therefore satisfy the equations 

M = °. M) = °> ‘ = °- 

From these it follows that 91" is (» - w)-dimensional. The 
relation between 91' and 91" is a reciprocal one : every vector 
of 91" is perpendicular to every vector of 91 and conversely. 
We then have 91 = 91' + 91", for if the sum s' + S' of a vector 
r' in 9T and a vector j" in 91" vanishes then £ = — £ is a 
vector which belongs to both sub-spaces and is consequently 
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perpendicular to itself, and this can only occur if £' = 0, A 
unitary correspondence which leaves 91' invariant will also leave 
91" invariant since the relation of perpendicularity will not be 
destroyed by such a transformation. In dealing with unitary 
correspondences or transformations it is therefore always possible 
to find an invariant sub-space SR" associated with a given invariant 
sub-space 9i', such that SR = +1R". The previous remarks 

about projection suggest that here in the unitary geometry we 
identify the space generated by projecting SR with respect to 
91' with the sub-space 9i" : we project on to the space 9t" per- 
pendicular to SR'. To this end we remark that among all vectors 
a in SR which are congruent mod. SR' there is one (a) which lies 
in SR" ; we then have 

(<*•«) — *(<*), (« + &) = (a) + (&)• 

With an arbitrary linear correspondence A 

1) -> t)' = A 1) : yi = £a ik y k (4.4) 

k 

of SR on itself is, as we have seen, associated a bilinear form 

£&ik£i Vk 
ik 

which depends linearly on a vector £ in P and a vector t) in SR. 
In unitary space we can therefore associate the form 

A(%, D) = £a ik x t y k , 

ik 

depending linearly on k) = (y { ) and £ = (£*), with the correspond- 
ence (4.4). It is in fact the scalar product of £ and At). The 
special case in which 

A = A or Afo, S) — A(i, t)) or a ki = a i1t (4.5) 

bears the name of the French mathematician Hermite. The 
correspondence (4.4) is consequently Hermitlan if the scalar 
product of £ with A t) is the conjugate complex of the scalar 
product of t) with A £. On identifying t) with £ we obtain the 
“ Hermitian form ” 

Afe) = Afg, S) = Za ik x t x k , (4.6) 

i.e, the scalar product of £ and A % ; in consequence of (4.5) its 
value is real. An Hermitian form or correspondence A is said 
to be non-degenerate if there exists no vector £, except £ = 0, 
whose transform A% vanishes It is positive definite if the value 
of the form A(f) >0 for all vectors £ #= 0 ; a positive definite 
form is non-degenerate. 

The fundamental metric form (4.2) is one such positive 
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definite Hermitian form, the “ unit form,” the coefficients of 
which consist of the numbers 


$ik — 


fl (i = k) 
to (i 4= k)' 


On introducing an arbitrary co-ordinate system a* (i = 1, 2, * • •, n) 
into the n-dimensional space, the absolute magnitude of an 
arbitrary vector 


is given by 


S = *i ai + **a 2 + - • • + x n a n 
t = 2ga gik - (a, a, £ ) . 


The expression for £ 2 is accordingly always a definite Hermitian 
form ; conversely, any positive definite Hermitian form G(j) 
could be taken as the fundamental metric form. To show this 
we employ the associated Hermitian bilinear form G($, ty) to 
carry through the following procedure, which is patterned after 
the step-by-step construction of a Cartesian co-ordinate system. 
Choose any non- vanishing vector e x ; since G(t x ) > 0 we may, 
on multiplying e x by an appropriate numerical factor, normalize 
it in accordance with the equation {r(e x ) = 1. When the process 
of constructing a system of unitary-orthogonal vectors t x 

G(e i} e. k ) = 8 ik 


has been carried through m steps, i = I, 2, • * •, m, the next 
step is accomplished by choosing a solution £ = e., n+1 of the 
m <n homogeneous linear equations G(e { , j) = 0 for the n 
unknown components of the vector J 4= 0 and normalizing it 
in accordance with the equation G(e m+1 ) = 1. The procedure 
comes to an end after n steps ; we then have n vectors 
e n * • *, e* of such a kind that 


where 


E) = %i x i + ^2^2 + • ‘ + x n x n 

5 = x x e x + x 2 e 2 + * * • + % n 


It follows from the equations themselves that £ can only vanish 
when all of its components x t vanish, and consequently the e { 
are linearly independent and constitute a co-ordinate system 
in SR. 

The transition from affine to metric geometry can accordingly 
be accomplished by the introduction of the axiom : 

(S) The square of the absolute magnitude of a vector £ is a real 
number £ 2 which is a positive definite Hermitian form in the 
components of £. 

These last considerations are useful in another connection. 
If is a linear sub-space of we can employ the construction 
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used above to find m vectors e 1} e 2 , • • *, e m in JR' which span 9i ; 
and are mutually unitary-orthogonal in the sense of the equations 
(e t -e fc ) = Sf fc . By continuing the construction we can supplement 
these m fundamental vectors by n — m additional ones 
fcftt-H) • • e w so that the two sets together form a co-ordinate 
system for the entire space SR. We can therefore adapt our 
normal co-ordinate system to the separation of SR' out of SR or 
to the decomposition of SR == SR' + 91" into two perpendicular 
sub-spaces. 

Since the correspondence A of SR on to itself is invariantively 
connected with the Hermitian form A in SR, we may speak of 
the product BA of two Hermitian forms A , B in SR, but this 
product is not in general Hermitian as 

BA = AB = AB. 

The trace of an Hermitian form or correspondence A is real. 
The positive definite expression 

tr (AA) = £\a il6 \* (4.7) 

{, k 

is of particular importance. When Si is decomposed into 
mutually perpendicular sub-spaces Sia (a = 1, 2, • • •) the section 
Aap of the correspondence or form A in which intersects Sip 
is uniquely determined ; it is a correspondence of Sip on 9t«, 

and Apx, the /la-section of A, is a correspondence of 3i« on Sip. 
When the co-ordinate system is adapted to the decomposition 
of Si we have 

tr (Aap Apa) — tr (Aga Aap) — 2? | a ilc | 2 (4.8) 

where in the sum i runs through the a tt , k through the /1 th set 
of indices. 

Any non-vanishing vector a determines a ray a which consists 
of all vectors of the form Aa, A being an arbitrary complex number. 
The generating vector a can be so normalized that its absolute 
value [ a | = I ; this does not, however, determine tt to within 
a change of sign, as in the real domain, as the normalization is 
unaltered on multiplying a by an arbitrary (complex) number e 
of modulus 1. We shall call the totality of vectors of 9R the 
vector field Si and the totality of rays the ray field Si. Any 
non-degenerate linear correspondence A of the vector field Si 
on itself is at the same time a correspondence of the ray field 
Si on itself, but this latter correspondence is unaltered by 
multiplication with any non-vanishing number. A unitary 
correspondence or transformation of the ray field on itself will 
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be briefly referred to as a rotation . By the symbol S' S we 
shall mean that the two transformations 5, S' of the vector 
field on itself differ only by a numerical factor s of modulus 1 : 
S' = sS, whence they both give rise to the same rotation of 
the ray field. 

§5. Transformation to Principal Axes 

The fundamental theorem on Hermitian forms is that con- 
cerning the transformation to principal axes . We are here 
concerned with the analogue of the familiar problem of finding 
the principal axes of an ellipse or ellipsoid in the ordinary 
geometry of two or three dimensions. We wish to find a normal 
co-ordinate system e t - associated with a Hermitian form A(%) such 
that in addition to 

£ = X x Z x “f" #2^2 "4" “4“ 

= XyX x + X 2 X 2 + • • • + (5-1) 

we also have 

A(l) = a x x x x x + a 2^12^2 + * • • + *»3 n *n 5 (5.2) 

that is, A shall be brought into the normal form (5.2) by means 
of a unitary transformation. The real numbers a 1? a 2 , • • a n 
are called the characteristic numbers of the form A, and 
e i» e 2) * * % e n the corresponding characteristic vectors . 

To this end we first consider the correspondence j j' = A% 
and seek those vectors j 4=0 which are transformed into 
multiples j' == Aj of themselves by A. We then obtain the 
“ secular equation ” 

/(A) sa det (A1 — A) = 0 

for the multipliers A. According to the fundamental theorem of 
algebra this equation certainly has a root A == a x ; corresponding 
to it a non- vanishing vector 5 = e x can be found which satisfies 
the equation At } = a . x t l9 and on multiplying this vector by an 
appropriate numerical factor we may take it such that its modulus 
is unity. e x can then be supplemented by n — 1 further vectors 
e 2 , • * •, e n in such a way that these « vectors constitute a normal 
co-ordinate system. In these co-ordinates the formulae 

e/ = Ae.i = 2J a ki^k 

k 

for the correspondence A require, in accordance with the 
definition of e x , that the coefficients a 2h a 31) * * *, a nl vanish and 
that a n == ol x . Because of the symmetry conditions a ki = a ik) 
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a 12 , a ls , * • *, a ln must also vanish. Hence in the new co-ordinates 
the matrix A assumes the form 


*1 

0 

0 • 

• • 0 

0 

#22 

#23 * 

• ’ a 2 n 

0 

#32 

#33 

• 4 

0 

#n 2 

#713 ‘ 

#7171 


and the Hermitian form becomes 

A(jc) = «!*!% + A'(t) (5.3) 

where A' is an Hermitian form containing only the n— 1 variables 
x 2 , x 3 , • • •, x n . Repeating this process, or calling on the method 
of mathematical induction, we establish the validity of the 
fundamental theorem stated above. 

The characteristic polynomial of (5.2) is 

det (A1 — A) = (A — a x )(A — a a ) • • • (A — a„). 

From this it follows that the characteristic numbers a 1( 
a 2 , • • •, a„, including their multiplicity, are uniquely deter- 
mined by the Hermitian form A ; their sum is the trace of A. 
What can we say concerning the characteristic vectors ? Let 
a be a given real number ; the vectors £ which satisfy the equa- 
tion A% — a£ constitute a linear sub-space 9?(a) of 91, the 
characteristic space belonging to a. When the normal 
co-ordinate system e< is so chosen that A is in the normal form, 
the equation A% = a£ is, in terms of its components, 

OLiXi = UXi 

from which it follows that 9 R(a) is spanned by those vectors e< 
for which a< = a. If, for example, the three roots x lt a 2 , a 3 = a 
while all the others are different from a, the characteristic space 
jR(a) is 3 -dimensional. If none of the characteristic numbers 
is equal to a, Sft(a) consists only of the vector 0 . This again 
characterizes the characteristic numbers, including their multi- 
plicity, in a way which is independent of the particular co- 
ordinate system chosen, and in addition it characterizes the 
corresponding sub-spaces 91(a). 9i is thus decomposed into the 
characteristic spaces 9 i(a) : 91 = ^9i(«) ; only a finite number 

Ot 

of terms occurs in this sum, i,e. those for which a is a character- 
istic number of A. A complete co-ordinate system e a , e 2 , * * •, e n 
for the entire space SR can be obtained by choosing a normal 
co-ordinate system in each non-null sub-space SR(ot). The 
normal form (5.2) is undisturbed on subjecting the variables 
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associated with the same characteristic number a t * = ot to an 
arbitrary unitary transformation. 

If, for example, a is a triple characteristic number 

H = = <x 3 = a 

^hile the remaining oc f 4= a, then x x e x + x 2 t 2 + is the 
normal projection of the vector j on $ft(a) and 

= Mi + M2 +• Ms 

is the scalar product of with itself. The equations (5.1), 
(5.2) may then be written in the invariant form 

£ 2 = I EM A® = * E&). . (5.4) 

a a 

$ft' being a sub-space of 5ft, any vector £ can be uniquely 
broken up into j' + £ 0 where £' lies in 5ft' and J 0 is perpendicular 
to 5ft'. The “ orthogonal projection” £->j' = E'j is a linear 
correspondence which obviously has the property 

E'E' = £', (5.5) 

for the projection of $' on 5ft' is simply j' itself. Furthermore, 
the operator E f is Hermitian , for the scalar product of i) into £' 
is equal to the scalar product of 1)' into £', where fy' is the projection 
of b on 5ft'. (The Hermitian form E*($) is accordingly the square 
of the absolute value of j'.) We shall call Hermitian forms 
which satisfy equation (5.5) idempotent. 

When the sub-spaces 5ft' , 9?" are orthogonal, the two corre- 
sponding projection operators £', E" satisfy the equations 

E'E" = 0, E"E' = 0, (5.6) 

for E f (E"£) is the component of E"£ lying in the space 5ft' per- 
pendicular to £"£. Idempotent operators which satisfy these 
equations are said to be independent. The second equation is, 
moreover, a consequence of the first, as may be seen on going 

over to the Hermitian conjugate : £"£' = 0. If 5ft is decom- 
posed into several mutually orthogonal sub-spaces 5ft' 4- 5R"-+ • * *, 
then 

£ = £'£ + E"i +-•••. (5.7) 

It is easily shown that the converses of all these assertions 
are also valid. If E' is an idempotent operator and E" = 1 — £', 
all vectors of the form jE'j constitute a linear sub-space 5ft' and 
all vectors of the form E" £ a sub-space 5ft". The equation 
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shows that the scalar product of a vector E'% in 9ft' and a vector 
jE" 1} in 91" is zero : xE’E"y 0. The decomposition of a 
vector % into a component lying in 9?' and one perpendicular 
to JR' is accordingly expressed by 

S«£'S + (1-B')S. 

If the two idempotent forms £" satisfy the equation (5.6) 
then, as we have just seen, the two corresponding characteristic 
spaces 91', 91" are mutually perpendicular. If the sum (5.7) 
consists of independent idempotent forms, then by the above 
the corresponding mutually perpendicular sub-spaces 9^', SR" 
exhaust the entire space 91. 

The theorem on transformation to principal axes can accord- 
ingly be stated : An Hermitian form A associates with the real 
numbers a mutually independent idempotent Hermitian forms E<x 
such that 

1 == A = 2Ja • Ex ] (5.8) 

a a 

E a is non-vanishing for only a finite number of values a. 

A correspondence A can be reiterated : 

AA = A\ A* A = A* • • • 

and we can accordingly obtain polynomials 

f[A) — c 0 l + C\A + c t A* -j- • • • + c h A k 

in A with numerical coefficients c. On reiterating (5.8) h — I 
times 

A h = 

a 

whence for the general polynomial / 

fi A ) = (5.9) 

a 

The characteristic numbers of f(A) are therefore the values of 
the polynomial /(a) for the characteristic numbers a of A. This 
suggests defining the Hermitian form f(A), where /(a) is any 
real function of the real variable «, by means of the equation. 
(5.9) _ 

Given two Hermitian forms A, B, under what conditions can 
they be brought simultaneously into diagonal form, i.e. when is 
it possible to find a normal co-ordinate system in which 

A $ = a A*, + W, + • • • + a„x n x n 
"(E) — Pl%l x l + + * • • + )3 n * n*« ? 


(5.10) 
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A necessary condition is that they commute : BA == AB, for if 
A and B are in the normal form (5.10) BA as well as AB is 
the diagonal matrix with elements ^oc t = a This condition 
is also sufficient ; to prove this, chQose a normal co-ordinate 
system in which A is already in normal form. The equation 
BA = AB requires that the matrix B = ||^ i3fc || satisfy 

&<*a* == a< b ik or (a,- — a k )b ik = 0. (5.11) 

We divide the indices i , the fundamental vectors e* and the 
variables into classes by considering i and k to be of the same 
class if a* = a*. Equation (5.11) states that b ik = 0 when 
i and k belong to different classes. B is consequently decom- 
posed into smaller matrices B\ B " aligned along the principal 
diagonal, corresponding to the way in which the a # are distri- 
buted in classes a', a", • • • ; the correspondence B consequently 
leaves each of the characteristic spaces 5ft(oc'), 5R(a"), • * • of A 
invariant. But we can then choose a normal co-ordinate 
system in each of these characteristic sub-spaces 5ft(a) in such 
a way that the Hermitian correspondences B B " in them are 
referred to principal axes ; the normal form of A is undisturbed 
by this procedure. 

This process can immediately be applied to any number of 
Hermitian forms : Any number of Hermitian forms can be brought 
simultaneously into normal form if and only if they commute 
with one another . By a slight modification we can further 
extend this theorem to an arbitrary finite or infinite system Z of 
Hermitian forms . This will be briefly discussed here, although 
in general the consideration of systems of forms or correspond- 
ence is postponed until Chap. Ill Let the space SR be decom- 
posed into mutually perpendicular sub-spaces 5ft', 5ft", • • * in 
such a way that each correspondence of the system Z takes 
place in these sub-spaces ; on adapting the co-ordinate system 
to this decomposition each Hermitian matrix A of Z consists 
of sub-matrices A\ A f \ - * • aligned along the principal diagonal. 
If all the A f are already multiples of the unit matrix 1 in 5ft' 
and similarly for all A'\ * * *, our goal is reached, for each corre- 
spondence A of the system then transforms 5ft' into itself and 
is a simple multiplication in it ; similarly for 5ft", • * \ But if 
this is not the case let A be a correspondence of the system 
which is not merely a multiplication in the sub-space 5ft'. On 
transforming the constituent A' of A to principal axes, 5ft' is 
decomposed into characteristic spaces 5R*' + 5ft a ' 4* • • • of A\ of 
which there are at least two. For any Hermitian matrix X 
of Z we have A'X' — X'A from which it follows, as we saw 
above, that X'- transforms each of the sub-spaces 5ft*', 5ft 2 ', * * * 
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into itself. The decomposition SR' + SR" + * * - can thus be 
further reduced to the decomposition (SR/ + SR 2 ' + * * * ) + 
SR" + • • \ Proceeding in this way we finally reach our goal 
after at most n steps, proving : 

The Hermitian forms of any system 2 can be simultaneously 
referred to principal axes if they all commute with one another . 

The theory developed above for Hermitian correspondence is 
valid as it stands for unitary transformations . S being any unitary 
operator , a normal co-ordinate system e* can be introduced in such 
a way that S carries each of the fundamental vectors e* over into 
a multiple <r *e* of itself. The characteristic numbers of 5 are 
numbers of modulus 1. In these co-ordinates the matrix of S 
is a diagonal matrix, the elements in the principal diagonal 
of which are the numbers or*. 

The proof is quite analogous. We again start with the 
secular equation 

det (at — S) = 0 

and consider the root cr x . There then exists a vector e x of modulus 
1 which is transformed into cr^ by the correspondence S . Sup- 
plement e x with n— 1 further vectors e 2 , • • •, e n so that these n 
vectors form a normal co-ordinate system. In these co-ordinates 
the matrix \\s i7c \\ of the correspondence 5 : 

Se* = £s ki e k 

k 

is again of the form 

= ^i, s 21 = • * • = s nl = 0. 

Since S is unitary the sum of the squares of the moduli of these 
elements of the first column must be unity, whence |or l | = 1. 
Similarly the sum of the squares of the moduli of the elements 
in the first row must also be 1 : 

Kl , + M*+- • - + h„|* = i; 

but since |<r 1 | a = ] it follows that 

^ 12 = • • • = ^ ln = 0. 

The matrix >S is now broken up into a 1-dimensional <r 1 and 
an (n — l)-dimensional S' as in (5.3) ; the truth of the above 
theorem then follows immediately by induction. 

The further results can be obtained in exactly the same way 
as above for Hermitian forms. The characteristic numbers o*, 
including their multiplicity but not their order, are uniquely 
determined by S ) and similarly for the corresponding sub-spaces. 
If we wish to find a linearly independent system of character- 
istic vectors, the fundamental vectors of each such sub-space 
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may be taken as forming a normal co-ordinate system. Finally, 
a finite or infinite set of unitary transformations can be simul- 
taneously reduced to normal form if and only if they commute 
among themselves. 


§ 6 . Infinitesimal Unitary Transformations 

A rigid body in continuous motion about a fixed point 0 
performs an infinitesimal rotation in each interval dr of time. 
Denoting by (dx j, dx 2) dx z ) the infinitesimal displacement of 
that point of the rigid body which is at the point P(x 1} x 2 > x z) 
at the time r, the equations of motion of the body must be of 
the form 

dr, - 

*i = ^r = Zc ik x k (6.i) 

dr * 

in which the coefficients c ik are constants, i.e. independent 

of the particular point P under consideration. Employing a 

Cartesian co-ordinate system with 0 as origin, x x 2 + x 2 2 + # 3 2 
must remain unchanged throughout the motion ; this requires 
that 

JJXi ^ = 0 or £c ik Xi x k = 0. 

i dr i h 


Since this equation must be satisfied identically in the x ir the 
matrix C = \\c ik \\ which characterizes the motion must be anti- 
symmetric : Cjd = — c ik . Introducing the vector I with origin 
at 0 and terminus at the point P, and the vector c= (c 23j ^3i, ^12), 
equations (6.1) become 



the familiar fundamental formulae for the kinematics of a rigid 
body. The square brackets denote the vector product and C 
the vectorial angular velocity, the absolute value and direction 
of which give the angular velocity and direction of the axis of 
rotation respectively. 

The continuous compounding of interest offers another 
example of an infinitesimal linear transformation. The interest 
rate being c, a real number, the increase in the capital x in time 
dr is xcdr. Radioactive disintegration is the same kind of a 
process with negative c . The capital #, considered as a function 
of the time, satisfies the equation 


( 6 . 2 ) 
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and consequently increases exponentially with r. If the prin- 
cipal has the value x 0 at time r = 0, it will have increased to 

x(r) = x 0 • e* 

at time t. To obtain an alternative solution we divide, as in 
the method of finite differences, the time interval r into a large 
number n of equal elements rjn ; x will increase by xcr/n in 
each of these intervals .and the capital x will accordingly be 
multiplied by (1 + ct/w)" at the end of time r. The familiar 
definition 

•— .‘ri 1 + s)' < 6 - 3 > 


of the exponential function follows from a comparison of these 
two results. But we can also solve the differential equation 
(6.2) by the method of successive approximations. We take as 
the 0 th approximation the initial value x 0 : x 0 (r) = x 0 . The 
(n + l)st approximation is obtained from the n ih by substituting 
the latter in place of x on the right-hand side of (6.2) and 
integrating : 

r 

x«+i(r) — x 0 + cjx n (t)dt. 

0 

On carrying out this process we find 



+ -+ 
r 1! + 


+ 



from which we obtain the familiar power series expansion 

n 1 \ Cr , («r)* , 

e cr 1 -\ 4- v — L • • • 

^ 1 ! ^ 2 ! + 


(6.4) 


for the exponential function. The convergence of (6.3) and 
(6.4) and the identity of their limits is rigorously proved by 
elementary analysis. 

These examples will assist in understanding the concept of an 
infinitesimal unitary transformation of the n-dimensional 
space 8t = SR„, which we now proceed to introduce. In order 
to avoid the use of infinitesimals we introduce a (purely fictitious) 
time r and think of the infinitesimal linear correspondence which 
carries the vector % over into j + dj as talcing place in the time 
interval dr : 


d l 

dr 


= Cl, 


dx ( _ 
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(For the sake of brevity we refer to this simply as “ the in- 
finitesimal transformation C.”) Since the transformation is 
unitary, on employing a normal co-ordinate system gXi %i must 

i 

remain unchanged : 

+ -p-t" 0 - <«> 


On setting 


= Zc ik x h , 

k 




the left-hand side of (6.5) reduces to the Hermitian form 

Z[ c ik H“ Cki)%i%k 
i,k 

and since it must vanish identically in the # t * we must have 
c ik + Cm = 0, or the transformation C is anti-symmetric in 
the sense of the equation 

= — c ki} C = — C. (6.6) 

In the real domain there exists no intimate relationship between 
symmetric and anti-symmetric matrices, but the situation is 
different in the complex domain. For on setting C = iH (i being 
the imaginary unit V — 1) it follows from (6.6) that H satisfies 

the equation H = H, and C is consequently i times an Hermitian 
matrix. In an infinitesimal unitary rotation of a vector field the 
dx 

velocity ~ is related to J by means of a correspondence whose matrix 

is i times an Hermitian matrix. The theorem on transformation 
of Hermitian forms to principal axes is accordingly the limiting 
case of an analogous theorem on unitary transformations. 

By repeated application of the infinitesimal unitary trans- 
formation 

di^ir-Cl (6.7) 

we obtain after time r 

S~>S(t) = C7(t) S = ^5 (6.8) 

where the exponential function e A for a matrix A can be defined 
by either 

A“( 1 + r)‘ 


or the power series 


Naturally 


1 , A A 2 

1 + n + 2i + ■ "• 

U(t + t') = U{t) U{t'). 



' fi 

i; 

' 'I 


jti 


30 UNITARY GEOMETRY 

Accordingly U(r) runs through all the transformations of a 
1-parameter continuous group of unitary transformations gener- 
ated by the infinitesimal transformation C ; the parameter t is 
additive on composition. The power series is obtained by the 
method of successive approximations ; this method can also 
be applied to obtain a solution in the more general case in which 
the infinitesimal unitary transformation C is not the same for 
each time element dr, i.e. in which C is a matrix CM depending 
on the time r. The solution of the equation s 

1=^)5 

for this case is given by 

sfo) = Ctyvitefo) ; 

the unitary transformation U{r t T^ which takes place in the 
time interval r 1( r 2 obeys the law of composition 

U ( T » T i) = U(t z t^ U(t 2 t^). 

If j = jo at time r — 0, the formulae for the successive approx- 
imations j, (t) are 


T 

So W = So ; Si +1 (r) = j 0 -f J C(t)%i(t)dt ; 

0 

for U(r) = U(r 0) we obtain the infinite series JJU, (r) in which 

1-0 


(6.9) 


T 

Uoi r) = 1 ; U l+1 (r) = f C{t) U l {t) dt. 

Written explicitly, 0 

U,( T ) = ff • • • JC(f I )C(f 2 ) • • • C{ti)dt x dt t • • • dti. 

(0 £ £ 1 * • £» fj , :£ r ) 

The proof of the convergence of this process is readily ob- 
tained with the aid of the quantity | A | associated with a matrix 
A = || a ik || by the equation 

Ml a =tr (AA) = Z\ a (k |* 

i,k 

It follows from the well-known Schwarz inequality 
I a i H~ #2 ^2 4- • • * -f- a n b n | 2 

that ~ "* I 2 + * * • + ( |*) (6.10) 

.... lA + B I ^ \A\+ | B\ 


\AB | £ \A\ \B\. 


and that 
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The second inequality is obtained by applying (6.10) to the 
element 

£ ik — fork 

of C = AB : 

k«|* g Z\<t ir \ 2 - Z\b rk \* 

r r 

and summing with respect to i and k. The first inequality may 
be stated in the form 

t r 

|Ji4(0*| ^ \\A{t)\dt. 

0 0 

for integrals. The convergence of ZU X (r) can now be established 

with the aid of these auxiliary results, for we can prove that 
under the assumption 

| C{t) | (0 gtgr) 

that 

| Ui(t) I s Vn- 

For this is certainly true for l = 0, and the recursion formula 
(6.9) enables us to conclude that it holds for U M if it holds for 
U h The convergence follows from this absolute convergence, 
for the absolute value of each component of the matrix A is 
certainly not greater than | A |, 

We have only gone into these matters to reassure the reader 
of the legitimacy of dealing with infinitesimal quantities of the 
kind met here. The only thing of importance for the following 
is the simple relation existing between infinitesimal unitary 
transformations and Hermitian forms. 

§ 7. Remarks on oo-dimensional Space 

The unitary spaces which appear in quantum mechanics 
usually have an infinite number of dimensions. Such a space 
consists of all vectors 

S — (#1, x t> ' • •) 

whose components x { constitute an infinite sequence of numbers 
for which 

J 2 = X^Xx "T "T 

converges. Within this domain addition and multiplication 
with numbers, as well as the construction of the scalar product 
of two vectors, are possible. All the axioms employed so far 
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are satisfied, with the exception of the dimensionality axiom y 
introduced in § 1. 

Since the vector components x 1} x 2l • • • constitute a de- 
numerable set, this “ Hilbert space ” has a denumerably infinite 
number of dimensions. But in addition to these, spaces of 
non-denumerably infinite dimensions may occur. Consider, for 
example, all continuous complex functions *ft(s) of a real variable 
s of period 2 tt. We need not distinguish between two values of s 
which are congruent mod 2rr, i.e. whose difference is an integral 
multiple of 2tt ; it is consequently more convenient to consider 
as a function defined on the periphery of the unit circle than on the 
straight line. The various values of s at points on the circum- 
ference play the r61e of indices, the value ifi(s) at the point ^ being 
the component of the “vector iff” with index s. The totality 
of such functions t/j(s) therefore constitute a linear “ function 
space ” of continuously infinite dimensions. Addition of these 
vectors and multiplication by a number have here the same 
interpretation as in the ordinary operations with functions. 
The square of the absolute value of the vector t/t is taken to be 

{<!•, <l>) = l*Ks)>fi(s)ds 
0 

and the scalar product of two vectors </> and \jj as 

0 

A set of functions 

<f> l{s), M s ), * • *i ^n(s) 
constitutes a. unitary-orthogonal system of vectors if 

l$i[ s )4>*[ s )ds = 

o 

These vectors span an n-dimensional sub-space St* of the oo-di- 
mensional function space, i.e. that sub-space consisting of all 
vectors of the form 

<K S ) = *1 this) + x*h(s) + • • • + X n <f, n (s). 

x u x t> '*•»*» are the components in the co-ordinate system 
<f>i, h, •■•,<!>* of the vector <f>{s) in 9t„. We have 

2ft 

(4>, <f>) = \^)<f>(s)ds = x x x t + *»*, + ••• + x n x„. 

0 



33 


REMARKS ON oo-DIMENSIONAL SPACE 

An arbitrary vector ip can be broken up into a component <f> 
which lies in 2ft n and a component tfi' perpendicular to SKn * 

tfs s <f> + if}', 

n 2 n 

4>{s) — E Xi <f>i{s), fa[s)ft(s)ds = 0. 

1 0 

It follows from these equations that [cf. (4.3)] 

2n 

Xi = J $i(s)<fi(s)ds. 

0 

These integrals are called the Fourier coefficients of the function 
ip with respect to the orthogonal system 0 t -. The orthogonal 
projection <p on 5R n cannot be longer (i.e. have greater absolute 
magnitude) than ip itself ; this is the content of the so-called 
Bessel inequality 

+ * 2*2 +'■•* + x n x n Si ^(s)ip{s)ds. (7.1) 

0 

In fact, since (<£, ip') = 0, (i p ' , <p) = 0, the “ Pythagorean theorem” 

Ws <A) = (<A, <A) + 0') 

holds. 

The simplest unitary-orthogonal system in the domain of 
periodic functions, with which the theory of Fourier series is 
concerned, consists of the functions 


^Le(ns) [» = 0, ;fc 1, ± 2, • • • ; e(x) = e ix ]. (7.2) 

This infinite system has the property of completeness ; it 
is a complete co-ordinate system for the entire function space. 
The theorem that any periodic function ip(s) can be expressed 
as a linear combination of the functions (7.2) : 




1 

's/ 2 tt 


+ 00 


x ” ■ e ( M5 )> 


n » - oo 


2jt 

0 


(Fourier expansion of ^(s)) is true only if certain conditions 
concerning the differentiability of p(s) are fulfilled, but any 
continuous function satisfies ParsevaVs equation 


2 * 


j p(s)ip(s)ds = 


-{- oo 


=» -oo 


(7.3) 
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We learn from this example that there is no essential distinction 
between spaces of a denumerable and of a non- denumerable infinitude 
of dimensions ; we have introduced into our function space 
a complete normal co-ordinate system (7.2) consisting of ^ 
denumerably infinite set of fundamental vectors. In an 
dimensional unitary space a system of unitary-orthogonal 
vectors is complete if their number is but not if it is less - 
however, such an enumeration gives no criterion for oo -dimen- 
sional space. If we leave out a finite number of the functions 
(7.2) we still have an infinite set left, but the completeness of the 
system is destroyed thereby. The real criterion for complete- 
ness lies in the validity of the completeness relation (7.3). 

We can understand the relations existing in Hilbert space 
by analogy with or as limiting cases of those existing in spaces 
of a finite number of dimensions. If we consider the values of 
an arbitrary periodic function i/j(s) only at the points 


s 




s ' n 


and set 



we are dealing with an rc- dimensional vector space in which the 
components of the arbitrary vector i/t are these quantities 
i v {v = 0, 1, • • •, n — 1). Let e A be the vector in this space 
with components 



0, l, • • % n -1]; 


these vectors e K (A = 0, 1 , • • *, n — ■ 1) constitute a normal co- 
ordinate system for the space, relative to which the vector £ 
has the components x 0 , x lf • • •, which are to be calculated 
from 


— l 



A«0 

Ip accordance with (4.3) 


K = 0 


whence 
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By passing to the limit n -> oo we obtain the equation of ParsevaL 
We do not concern ourselves here with the further considerations 
which may be necessary to establish a rigorous proof, but content 
ourselves with such reasoning by analogy. 

We consider the linear correspondence or “ operator ” 
1 d 

D = -r — which transforms a function \ft[s) in the domain of 


periodic functions into 


e(ns) is the characteristic vector 


(characteristic function) of this operator belonging to the 
characteristic number n : 

1 deins) . v 

~ — i — l — n • eins). 
x ds v ' 

This operator is Hermitian ; the scalar product of <f> and Di/j 
is the conjugate complex of that of t/f and D<j>, where <f> and 0 
are any two periodic functions, for by partial integration 

2n 




and the right-hand side is conjugate to 

2 n 

1 JA 4s. 




ld£ 

i ds 


In fact, the Hermitian form 


assumes the normal form 




oo 

Z nx n x n 

ia a-00 


( 7 . 4 ) 


in the normal co-ordinate system whose fundamental vectors 
are the characteristic vectors of the operator D. The reiterated 
d 2 

operator DD = — appears in the theory of the vibrating 
string, together with the corresponding Hermitian form 


\r 


2 n 


f d A d lds 
)ds ds 
0 0 

which represents the kinetic energy of the string. 



36 UNITARY GEOMETRY 


We have here been dealing with a discrete spectrum of char- 
acteristic numbers . But in an oo- dimensional space Hermitian 
forms with a continuous spectrum can also be constructed. 
Consider, for example, the function space consisting of all con- 
tinuous functions ^(s) defined in the interval — rr ^ s g + n ; 
the square of the absolute magnitude of the “ vector ” if/ is then 

(«A, <W = f${s)<p(s)ds. 

— n 


The Hermitian form 

+ ft 

= §sf(s)tp(s) ds (7.5) 

— ft 

is already in normal form, which shows that it has as character- 
istic numbers all numbers between — n and + tt. The functions 
(7.2) again constitute a complete normal co-ordinate system in 
terms of which 

+ 00 

<P(s) ~ *• «M- 

n —— oo 

Substituting this in (7.5) we find 

4 - n 

A[>l>] = Za mn x m x n , a mn = ^se[— ms)e(ns)ds. 


+ n 


The evaluation of 

p * e[(n — m)s]ds 

— n 

yields 0 when n = m and by partial integration 
I , . j[( n-m)5] -| + " = l)-" 


[' 


t(n — m) _L ; 
when n + The Hermitian form 


i(n — m) 


m 


(- 1 )- 






has therefore as characteristic numbers all values between 
— it and -f- rr. 

The characteristic vector tf/ a belonging to the characteristic 
value a (~ it oc nr) of A[t(/] is that function which vanishes 
at all points s 4= & and is there so large that the integral of 
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fi* has the value 1. Of course such a function does not really 
exist, but we can approximate it as closely as we wish. In 
order to arrive at a formulation which is mathematically rigorous 
for the case of continuous spectra, we must introduce in place 
of the idempotent Hermitian form in (5.4) the idempotent 
form A E = £ E x for the entire interval A = Af (a g A < jS). 

For any given vector £ 

A m ^ 0, A£E(e) + A J£(S) = A yJE(t (7.6) 

and the idempotent forms A E associated with two separated 
intervals A are mutually independent. 

In dealing with the continuum, the sum in (5.4) is replaced 
by a Stieltjes integral. Consider the straight line described by 
the real variable A as being covered with a substance, and let 
the amount of this substance on the interval A be denoted by 
Aw. We then have, in analogy to (7.6), 

Am ^ 0, Afm -f- Ajjm = A y jn. 

If (f>( A) is a continuous function of position we can construct 
the integral 

i 

\<HX)dxm. (7.7) 

0 

An approximation to this integral can be found by dividing the 
entire interval 0 <£ A ^ 1 into small intervals A*-, choosing a 
point A t - in A, and evaluating the sum ’ A <w. This sum 

i 

then converges to the integral on allowing the A,- to approach 
zero. If the distribution has a continuous density 

lim 7R = 

1 

the integral is identical with jV(A)/>(A)dA. But the Stieltjes 

o 

integral (7.7) also includes the cases in which there exists no 
finite continuous density ; in particular, it allows the existence 
of discrete points at which a finite amount of the substance is 
concentrated. If the substance is distributed over a finite 
number of points A = a t * in amounts m i} the Stieltjes integral 
reduces to the sum JF^(a*)w *. 

t 

We thus arrive at the following more inclusive formulation 
of the fundamental theorem concerning the transformation to 
principal axes : (1) The Hermitian form A associates with each 
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interval A an idempotent form A E(%) ; (2) when two adjacent 
intervals A 1; A 2 are added, together to form an interval A, 

A E = A jE + A t E, 

and the idempotent forms associated with separated intervals are 
independent ; (3) we have 

I 2 = A[i) = f A • d x E(i). 

— 00 — 00 

In this form the theorem is adapted to the appearance of con- 
tinuous spectra of characteristic numbers, and is particularly 
appropriate for the purposes of quantum mechanics (cf. II, § 7). 
The discrete characteristic numbers lie at those points where 
the monotonic increasing function Ai *>£(£) =£( A; 5) of A has 
a discontinuity. In our example (7.5) 

= \t(s)4,(s)ds ; 

a 

here ip must be taken as 0 outside the interval (— 7 r, + *•). 
The evaluation in terms of the co-ordinates x n is readily accom- 
plished. 

Consider the function space consisting of the totality of 
all functions tp(s) of a variable s f which assumes all values from 
— 00 to + 00, and which have a finite absolute magnitude 

(•A. $ = l${ s )<l>(s)ds, 

— 00 

i.e. which are “ integrable square. 1 * The characteristic functions 

associated with the linear correspondence ip(s) are again 

the functions e[vs) } but the frequency v can now assume all real 
values . The components of ip(s) are the quantities 

+ « 

M = lA (* )•(— vs)ds. 

— 00 

Fourier's integral theorem then allows us to conclude the validity 
of the expansion 

+ 00 

m = ^m\ e{vs)f[v)dv 

— 00 
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under certain assumptions concerning the differentiability of 
the function i/j(s) ; but in any case the completeness relation 1 

+ 00 + oo 

J j(s)<Ks)ds=$f( v )f( v )d v 
- 00 - 00 

is valid* We arrive at a somewhat different problem when we 
only require that the functions be such that fi[s)ifj(s) 

possess a definite mean value 

+ a 

lim 2^J ${s)*l>(s)ds = (tp, P ) ; 

— a 

this leads to the theory of almost-periodic functions developed by 
H. Bohr. 2 Here again the validity of the completeness relation 
can be established. 

The theory of the characteristic numbers of Hermitian forms 
in infinitely many variables has been developed by Hilbert and 
Hellingerf but it is applicable only to bounded forms 


A{l) = 

i,k 

i.e. forms whose values have a fixed upper bound when 

t = S 1. (7.8) 

i 

Indeed, without this assumption we cannot guarantee the 
convergence of A(%) in the entire domain (7.8) ; as an example 
consider the form (7.4), Znx n x n . That this form only converges 

n 

in a portion of the domain (7.8) is merely another expression of 
the fact that not every continuous function is differentiable. 
The situation is more favourable for unitary forms as they 
satisfy the condition that they be “ bounded ” in consequence 
of their very definition ; a unitary transformation is thereby 
to be taken as satisfying both of the conditions 

UU = 1 } UU =s 1. 

The theorem on principal axes has been proved rigorously for 
bounded Hermitian and for unitary correspondences in oo- 
dimensional space. A method due to A. Wintner 4 seems 
particularly appropriate for dealing with unitary correspond- 
ences ; it is based on the consideration of the discrete group of 
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all powers 17* of the given unitary transformation U, and deter- 
mines the monotonic increasing function E(\ ; j) of the real 
variable A (0 A s* 2 ir) by means of the equations 

C7»( S ) = jV^E(A; S ) (7.9) 

0 

(the problem of trigonometric moments). J. v. Neumann 5 has 
gone furthest in dealing with linear operators for which bounded- 
ness is not postulated. In accordance with § 6 with a Hermitian 
form A is associated a group of unitary correspondences U[r) 
depending on the real parameter r and satisfying the equation 

U(r + r') = U(r) C7(r'); (7.10) 

the study of this group is equivalent to the study of A, It is 
therefore perhaps appropriate to replace this latter for oo- 
dimensional space by the former, for no convergence difficulties 
appear in the domain of unitary transformations. We must 
therefore attempt to ’bring the operators C7(r), which are con- 
tinuous functions of the real parameter r' satisfying (7.10) 
simultaneously into the form 

2 n 

U{t] l) = \^d k E{ A;*). (7.11) 

0 

This is accomplished with the aid of Wintner’s method on re- 
placing the discrete parameter n in (7.9) by the continuous 
parameter t. The problem (7.11) bears the same relation to 
(7.9) as Fourier’s integral bears to Fourier series. 

In setting up a system of axioms for oo- dimensional vector 
space the axioms (a), (y3) of § 1 and the metric axiom (8) of § 4 
can be retained ; for the proper substitute for the dimension 
axiom (y) see, e.g., v . Neumann , “ Mathematische Begriindung 
der Quantenmechanik.” 8 

The algebraic and geometric tools developed in this chapter 
offer a natural medium for the expression of quantum mechanics ; 
they already hold a dominating position in the classical physics 
of continuous media. A masterly exposition of their mathe- 
matical content and application is found in the first part of 
Courant'Hilbert's “ Methoden der mathematischen Physik,’* 
2nd ed. (Berlin, 1930). 


CHAPTER II 


QUANTUM THEORY 


§ 1. Physical Foundations 1 


T HE magic formula 


hv 


( 1 . 1 ) 


from which the whole of quantum theory is developed, establishes 
a universal relationship between the frequency v of an oscillatory 
process and the energy E associated with such a process. The 
quantum of action h is one of the universal constants of nature 


h = 6*547 X 10~ 27 erg secs. 


It was first discovered by Planck at the turn of the century in 
the laws of black body radiation ; that is, radiation which is 
enclosed in a cavity and is in thermodynamic equilibrium with 
matter of a definite temperature, which by emission and ab- 
sorption causes an exchange of energy between the various 
frequencies contained in the radiation. Since this equilibrium 
is independent of the particular nature of the matter involved, 
Planck considered, as a kind of schematic matter, a system of 
linear oscillators of all possible frequencies. A charge oscillating 
with frequency v interacts with the electromagnetic field by emitt- 
ing and absorbing radiation of the same frequency. Planck as- 
sumed that the exchange of energy took place in integral multiples 
of an energy quantum s ; he at first considered this assumption 
merely as a mathematical device, and intended to pass to the 
limit e — 0. In order to obtain agreement with the Wien 
displacement law, which was derived from general thermo- 
dynamical principles, the energy quantum associated with a 
definite frequency v must be taken proportional to v: e = hv. 
In this way Planck obtained his radiation formula, which is in 
excellent accord with observation ; according to it the amount 
of energy contained per unit volume in the spectral interval 
y, v + dv in thermodynamic equilibrium at temperature 6 is 


u(v)dv = 


Sirh^dv 


41 


( 1 . 2 ) 
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■where c is the velocity of light and k the Boltzmann constant 
(|fefl being the mean energy of an atom of a monatomic gas at 
temperature 0). On passing to the limit h = 0 we obtain the 
Rayleigh- Jeans radiation law 


u{v) = 



The assumption of the validity of this latter law for the entire 
spectrum is in gross disagreement with the facts, as it would 

lead to an infinite value for the total energy \u(v)dv ; a state of 
equilibrium would therefore be impossible with given finite 
energy. 

The idea of a quantized exchange of energy, which occurs 
in Planck’s derivation somewhat schematically and only in 
application to statistical thermodynamical consequences, was 
first seriously applied to individual atomic processes by Einstein. 
In 1905, guided by the observations of H. Hertz , Hallwachs 
and Lenard on the photo-electric effect , he enunciated the idea 
of a light quantum or photon as “ an heuristic viewpoint con- 
cerning the generation and transformation of light ” * according 
to which not only the exchange of energy between matter and 
radiation of frequency v occurs in quanta of amount hv, but 
further, light of frequency v can exist in the ether only in quanta 
of energy hv. The decisive experiments were first performed 
by Millikan ten years later. By allowing ultra-violet or X- 
radiation of frequency v to fall on a metal plate electrons are 
released whose kinetic energy (as was already known to Lenard) 
increases with the hardness (i.e. with decrease of wave-length) 
of the incident radiation ; the energy with which the electrons 
are emitted is, however, not influenced by the intensity of the 
radiation. The exact relation predicted by Einstein is 

hv-P = ^ = eV 

where — e, m and v are the charge, mass and velocity of the 
electron, respectively. The energy hv of the photon is trans- 
formed into kinetic energy of the electron, after subtracting 
from it the work P required to pull the electron out of the metal 
surface. If the potential difference between the metal surface 
and a plate placed in front of it is V the electron current will 

hv 

disappear as soon as V exceeds the critical value V 0 = — - 

6 

Millikan found that the potential at which the current vanished, 
obtained by extrapolation, was in fact exactly proportional to 
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the frequency v for monochromatic light of various frequencies, 
and that the constant of proportionality was equal to the 
quotient of the h obtained by Planck from black body radiation 
and the elementary quantum of electric charge e. The differ- 
ence of the mean energy P for two different metals is furthermore 
equal to e times their contact difference of potential. The 
value of P, or at least its order of magnitude, is therefore known, 
and we find that for X-rays of a few Angstroms wave-length 
(lA = 10“ 8 cm.) P is negligible in comparison with hv. The 
equation 

hv = ^ = eV (1.3) 

governs not only the generation of secondary cathode rays by 
primary X-rays, but also the inverse process : the transformation 
at the glass wall or on the anode of the incident cathode rays 
into the impulse radiation first observed by Rontgen . If an 
electron which has run through the potential drop — V in the 
X-ray tube loses its entire energy on collision, a photon of fre- 
quency v and energy hv = eV will spring into existence. The 
electron may, however, only be slowed down ; consequently 
v is only the upper limit for the frequency of the impulse radia- 
tion, which will therefore consist of a continuous spectrum with 
eV 

a sharp limit at v = The old classical theory of radiation 

was entirely unable to account for this most characteristic 
property of the impulse radiation. The frequency of the limit 
increases in proportion with the applied potential — and this is 
the exact formulation of the fact that “ the higher the potential, 
the harder the rays ” so familiar to every X-ray operator. 

The observed phenomena thus confirm the hypothesis that 
radiation of frequency v can be absorbed and emitted only in 
quanta of energy hv. This hypothesis will of course have further 
consequences for the theory of the structure of matter. The 
Planck oscillator will, for example, be unable to alter its energy 
continuously since it can only emit or absorb these fixed quanta 
of energy, and it will consequently spring to and fro on the rungs 
of its energy ladder, which are equally spaced at intervals hv ; 
v is here the frequency of the oscillator , a constant determined by 
the constitution of the oscillator. An application of the essential 
elements of this idea to actual atoms gave rise to the frequency 
rule enunciated by Niels Bohr (1913) : 

An atom can exist only in certain discrete stationary states 
(“ quantum states ”) in which it does not radiate. Light wilt be 
emitted on transition from one state into another ; the energy which 
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it loses in this transition , the difference E x — E t of its energy in 
the two states , will be transformed into a photon of energy hv i the 
frequency v of which is determined by the equation 

hv — E 1 — E 2 . (1.4) 

In this equation E lt E 2 may be any two of the discrete energy levels 
(E x >Ef). Conversely , in absorption a photon raises the atom from 
the energy level E x to a higher E 2 by giving tip its energy hv to the 
atom . 

According to classical electrodynamics an atom should 
continually emit radiation in consequence of the vibrations of 
its constituent electrons, and the frequencies of the emitted 
light should agree with the frequencies of the simple oscillations 
into which the motion of its electronic system can be resolved. 
But the atom will itself lose energy through this radiation, the 
motion of its electrons will thereby be modified and the fre- 
quencies will consequently be displaced. This entire point of 
view is therefore irreconcilable with one of the most fundamental 
physical facts : the existence of sharp spectral lines* On the 
other hand, Bohr’s assumption is not only in agreement with 
this fact, although it offers no such detailed picture of the 
reaction between matter and ether as the classical theory, but 
contains in addition the fundamental Ritz-Rydberg combination 
principle . If we order the energy levels in an increasing series 
E 0 < Ei < E 2 < • • •, then in accordance with (1.4) each 
frequency v is the difference of two “ terms ” v % = £,•/&, 

v[i-+k) = Vi — v k (i > k). 

Consequently there will occur in addition to the frequencies v(i -» k), 
v(k-+ 1) the frequency 

v(i ->/)== v (i k) + v(k -+■ l ) (1.5) 

obtained from them by addition . This combination principle is 
valid without exception in the whole of spectroscopy, in the 
optical region as well as in that of X-rays, and has proved to 
be a valuable guide in the classification of spectra ; it reduces 
the complex line spectra to the simpler term spectra. Un- 
fortunately the problem is made more difficult by the fact that 
not all lines corresponding to possible transitions i -> k need 
actually occur— not every term v { need “combine” with a 
given term v k for the conditions of excitation may be such 
that certain lines have zero intensity. The selection rules for 
the allowable transitions will therefore be contained in the 
rules which determine the intensities of spectral lines. The 
combination principle, or the Bohr frequency rule, determines, 
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so to speak, only the keyboard of the spectrum — which tones 
are really struck is dependent on the mode of excitation. But 
it will in general be possible under proper conditions of ex- 
citation, e.g. the influence of strong external electric fields, to 
bring out the lines which are not observed under ordinary 
conditions. 

In the “ unexcited ” or normal state the atom is in the stationary 
state of lowest energy E 0) and consequently only the lines of the 
“ series ” n -> 0, of frequency v n — v 0 (n = 1, 2, * • •), occur in 
absorption. The lowest of these 1 0 (i.e. with greatest wave- 

length), or more precisely the lowest which is not forbidden by 
the selection rules, is called the “ resonance line.” 

The simplest atom is that of hydrogen ; in it a single electron 
of charge — e revolves about a nucleus of opposite charge + 
The terms of the spectrum of atomic hydrogen are found by 
observation to be given by the equation 

- n = -- 2 ( 1 . 6 ) 

c n 2 

where R = 109700 cm.” 1 is the Rydberg constant (spectroscopists 
are accustomed to give the wave number rfc, the reciprocal wave- 
length, instead of the frequency v). The energy levels corre- 

Rhc 

sponding to these frequency terms are E n = r . To this 

n 

discrete term spectrum we must add the continuous spectrum 
E Si 0 ; the additive constant in the energy is so chosen that 
E — 0 separates the hyperbolic electron orbits from the elliptic. 
The Balmer series consists of the lines n 2 with wave numbers 



This is the oldest known series formula ; Balmer obtained it in 
1885 by abstraction from the first four lines of the series, called 
Ha, Hu, H y , H 6 , which lie in the visible region. The lines of 
this series converge with increasing n to a limit with wave 

number j (wave-length ^ = 3 6 50 A V is the work required 

to ionize an H-atom in the stationary state n — 2, i.e. the work 
required to remove the electron from such an atom without 
leaving it with kinetic energy. The continuous spectrum, 
arising from transitions which ionize the atom, will join on to 
this series limit on the short wave side. We are further ac- 
quainted with the Lyman series n ->• 1 which lies in the ultra- 
violet and also occurs in absorption, the Paschen series n -> 3 
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lying in the infra-red, and finally with some members of the 
Brackett (n 4) and Pfund (n 5) series in the far infra-red- 
In order to ionize hydrogen in the normal state an amount cRh 
of work must be done ; the corresponding “ ionization potential,’ * 
i.e. the potential difference an electron must traverse before it 
is able to ionize atomic hydrogen by means of its kinetic energy, is 

V _ cRh _ 13>53 volts 
e 

Bohr’s frequency rule goes beyond the combination principle 
in asserting that the terms are actually energy levels, an assertion 
irrelevant to and not verifiable by spectroscopy. That this is, 
however, in fact the case is confirmed by the experiments of 
Franck and Hertz on collision phenomena . 3 In these experiments 
electrons are given an amount eV of kinetic energy by allowing 
them to pass through an electric field of known potential differ- 
ence — V and are then allowed to pass through a gas consisting 
of the atoms which are to be investigated with the velocity thus 
obtained, without further influence from external fields. The 
electron can give up no energy to the atom until eV is greater 
than the excitation energy E ± — E 0 of the resonance line; if 

E x — E 0 < eV < E 2 — E 0 

then the electron can either suffer an “ elastic collision,” in 
which case it loses no energy, or it can suffer an “ inelastic 
collision,” in which case it loses an amount E x — E 0 to the 
atom. The electrons which have passed through the gas are 
of two kinds, those with kinetic energy eV and those with 
eV — (E x — E 0 ). When the atoms which have been raised 
from the state 0 to the state 1 by collision with electrons fall 
back into the normal state they emit the resonance line and, 
under the above conditions, only this line. This is fully con- 
firmed by the experiment. The kinetic energy of the emerging 
electrons is measured by introducing a retarding potential V' ; 
the electrons only come through it if their energy is greater 
than eV'. In general the electrons possess a discrete “ energy- 
spectrum ” after collision with an atom of the gas ; the possible 
energy values are 

eV n ' = eV- (E n - E 0 ) 

(n = 0, 1, 2, • * •, in so far as V n ' is still positive ; we here dis- 
regard the possibility that a single electron may suffer more than 
one inelastic collision). On allowing the retarding potential V' 
to decrease gradually from a value which is greater than V the 
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electron current decreases suddenly whenever V f passes through 
one of the values V ' 0 , V [ , • * \ 

Bohr’s frequency rule reduces the determination of spectra 
to the problem of obtaining the stationary states and the correspond- 
ing energy levels of an atom , i.e. of a mechanical system of known 
dynamical constitution. The example of the linear oscillator 
given above and the fundamental notions of the theory of 
oscillations suggest the following as a general guiding principle 
(P) : the frequencies derived from the energy levels by means 
of Bohr’s frequency rule shall correspond to the frequencies of 
the simple vibrations into which the actual motion of the atomic 
constituents can be resolved in accordance with the laws of 
dynamics. Such a resolution into simple oscillations is con- 
vincingly attainable in classical mechanics only if the system 
is “ multiply ” or “ conditionally periodic,” and for this case it 
was actually found possible to sharpen the general principle (P) 
into a definite rule for quantization. In the years 1913-25 the 
application of this quantum rule yielded a great harvest of 
results, and it seemed that we were in possession of the key that 
would unlock the mysteries of atomic processes. But the wards 
did not quite fit ; toward the end of this epoch its failure became 
more and more apparent and the physical theory was gradually 
reduced to a symbolic calculus of quantum numbers which had 
to be corrected each time a new fact was discovered. We do 
not wonder now that it ran such a course, but rather are surprised 
that it was as successful as it was 1 

From the beginning the quantum rules were a compromise. 
If a mechanical system of one degree of freedom undergoes a 
periodic motion the frequencies v of the simple vibrations into 
which its motion can be resolved are integral multiples of a 
fundamental frequency a>. This frequency depends on the 
energy of the orbit under consideration, and this latter is re- 
stricted by the quantum rules to the discrete set E n . The 
internal frequencies of the motion are therefore given by the 
formula 

v = k * o){n) (1.7) 

which depends on the two integers n and k. By the analogy 
with quantum mechanical frequencies this internal frequency 
(1.7) is to be ascribed to the jump n [n — k). The fact that 
v depends linearly and homogeneously on the jump k is expressed 
by the “ classical combination principle ” 

v(n -> n — k) + v(n n — l) = v{n -> n k 0 (1-®) 
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in. consequence of which frequencies with the same initial statu 
n will combine. But this is not in accord with the correct 
combination principle 

v{n n - k) + v{n - k-> n - k - t) = v(n -*n - k — I) (1.9) 

The changes k, l in the quantum number are here the same as 
in (1.8), but the final state n — koi the first frequency coincides 
with the initial state of the second ; only for quantum numbers 
n which are large compared with k and l does the classical 
principle agree asymptotically with the Ritz-Rydberg com- 
bination principle. Consequently if the general principle (P) 
is to be satisfied without compromise our mechanics must be 
altered in such a way that the false combination principle (1.8) 
is replaced by the correct one (1.9). In 1925 Heisenberg dis- 
covered a way in which such an alteration can be naturally 
accomplished ; in order to do this, however, it was necessary 
to give up the picture of an atom with its electronic orbits. 
The quantities with which the Heisenberg theory deals are 
only the frequencies and intensities of radiation associated with, 
transitions between the various states of the atom. 

It should tie observed that the correct combination principle 
(1.9) is in one important respect simpler than, the false one (1.8). 
As the formulation 

v(n" -> »') -+■ v[n' n) — v(n"^-> n) (1-10) 

shows, the quantum numbers serve only as distinguishing marks 
or indices which do not involve a law of composition, whereas 
the classical formula requires the addition of quantum numbers , 
which are therefore numbers on a definite scale. 

Another approach to quantum mechanics was discovered 
by L. de Broglie and E. Schrodinger* This approach seems to 
me less cogent, but it leads more quickly to the fundamental 
principles of quantum mechanics and to the most important 
consequences for experimental science. We shall therefore 
follow it, since we are more concerned in giving a short but 
comprehensive account than in giving a complete discussion of 
the physical foundations. The physical, essentially statistical, 
interpretation of the theory, with which Schrodinger has not 
been entirely in accord, is due mainly to M. Born. 

§ 2. The de Broglie Waves of a Particle 

We consider the undulatory character of light as guaranteed 
by the phenomena of diffraction and interference. Their most 
decisive feature is that with them we are dealing with the linear • 
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superposition of waves with arbitrary differences of phase . From 
the mathematical standpoint, they are characterized by the fact 
that they involve addition and multiplication with complex 
numbers, and we are consequently dealing with vectors in a 
complex space. We can, in fact, consider a complex function 
i l*(t ; xyz) employed in the description of the phenomena and 
defined over time and space as such a vector, where each space- 
time point represents one dimension of a complex vector space ; 
the differential laws for such a wave function ift — or for several 
such functions simultaneously, such as the components of the 
electric and magnetic field strengths — are linear and homo- 
geneous. But on the other hand the quantum phenomena 
which we discussed above speak just as plainly in favour of 
the corpuscular nature of light . The intensity of the mono- 
chromatic radiation employed in the production of the photo- 
electric effect has no influence on the velocity with which the 
electrons leave the metal ; it influences only the frequency of 
this event. Even with intensities so weak that on the classical 
theory hours would be required before the electromagnetic 
energy passing through a given atom would attain to an amount 
equal to that of a photon, the effect begins immediately, the 
points at which it occurs being distributed irregularly over the 
entire metal plate. This constitutes a proof of the existence of 
photons which is no less direct than the proof that oc-particles are 
of corpuscular nature by observing the scintillations caused by 
them on striking a sensitized screen. Further, if one considers 
the exchange of momentum in addition to that of energy in 
deriving the laws of black body radiation, conflict with Planck's 
hypothesis concerning energy quanta can be avoided only by 
assuming that in addition to the emission of the energy quantum 
hv a quantum hvjc of momentum is emitted in a definite direction , 
producing an equivalent reaction on the atom . 5 We here replace 
the continuous radiation of a spherical wave by the discontinuous 
emission of photons in definite directions which are irregularly 
distributed over the compass. 

We unite the two standpoints by retaining the linear wave 
equation , but considering the intensity pf as the relative probability 
that the photon appears at the point (x, y, z) at time t ; or, more 
precisely, that 


tji [ ft dxdydz 


(2.1) 


is the probability that at time t it will be found within the small 
parallelepiped with sides of length dx , dy, dz about the point 
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(x, y } z)* But we can only expect to arrive at a rational theory 
if we deal with material particles in the same way as with photons . 
This point of view was developed in the Bose- Einstein treatment 
of an atomic gas, which paralleled that employed in the theory 
of black body radiation (“ light quant gas ”). 6 Schrddinger's 
researches took as their point of departure the Hamiltonian 
theory of mechanics, which was originally obtained by Hamilton 
himself from an analogy with geometrical optics. He argued 
that since we replace geometrical optics, with the aid of which 
interference and diffraction cannot be treated, by wave optics, 
it is reasonable to attempt the analogous transition in mechanics. 
The results amply justified the attempt. The investigations of 
Davisson and Germer , which prove the existence of interference 
in beams of electrons reflected from a crystal lattice, were already 
in progress when de Broglie published his theory. The experi- 
mental evidence that moving material particles behave in much 
the same way as a beam of light with respect to these phenomena 
is now fully established, and with no less certainty than for 
X-rays, by a series of further investigations by the same 
authors and by G . P. Thomson , F . Rupp and others. 7 The 
real difference between “ light-like ” and “ electron-like " beams 
lies in the fact that the particles composing the latter possess 
charge and proper mass and can consequently be deflected by 
electric and magnetic fields. 

A simple oscillation is one in which the function i/j } defining 
the state of the system, depends on the time in accordance with 
the law 

xfj(t) = a • e~ ivt (2.2) 


where a and v are independent of t . [We choose as our unit 
of angular measure that one which proves most useful in differ- 
ential calculus, for it yields the simple relation 


1 de(x) 
i dx 


= «(*) 


(2.3) 


for the fundamental trigonometric function e ix = e(x). The 
sum of the angles about a point is then 2n ; it would, admittedly, 
be more correct from the integral standpoint to take this as 1, 
but then the factor 2n would appear in the differential relation. 
v/ 27 r is the number of oscillations in unit time ; we shall not 

* Just as in the classical wave theory we have an expression for the flow 
of energy in addition to its density, so in the more refined formulation of 
quantum theory we will have an expression for the probability that the 
photon passes through a given element of surface (“ probability current") in 
addition to one for the probability that it be found in a given element of 
volume ("probability density"). 
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hesitate, however, to use the name “ frequency ” for v. If we 
denote Planck's constant of action by 2 nh instead of h } and we shall 
throughout the present work, the fundamental formula (1.1) 
will still be valid in the new nomenclature.] In accordance with 
(2.3) the simple oscillations (2.2) are the characteristic functions 
of the linear Hermitian operator which carries iff over into 

— j -- ; the corresponding characteristic numbers are the 

energies E == hv. If the dependence of a state of the system on 
time is described by a superposition of simple oscillations 

iff{t) = a 1 e^ t + a 2 e~ iv ^ + • • •, (2.4) 

the energy is capable of assuming only one of the values hv 1} 
hv 1} • • *, and we shall take the intensity a r a r = | a r | 2 of the 
oscillation of frequency v r in if) as the relative probability that 
the energy is observed to be hv r . The relation E = hv is accord- 
ingly to be interpreted : if v is indeterminate because an entire 
spectrum of frequencies v is contained in the oscillatory process , then 
the energy is indeterminate to the same extent ; the intensities 
with which the various simple oscillations occur in the process 
measure the probabilities of the corresponding energies. The 

operator — ^ ~ represents the energy : 

i at 

in the following sense : a characteristic function of (2.5) represents 
a state in which the energy assumes a definite value E with certainty. 

This value is the corresponding characteristic number ; in an 
arbitrary state the components a of $ with respect to these character- 
istic functions determine the relative probabilities a a of these 
values E . 

According to the theory of relativity energy is to be con- 
sidered as the time component of a 4-vector whose spatial com- 
ponents constitute the linear momentum p = (p Xi p Vj p z ). The 
fundamental metric invariant of the two vectors running from 
the origin to the points ( t } xyz ), (t' f x'y'z') is the scalar product 

cHt' — (xx f + yy' + zz'). 

Under a Lorentz transformation, which transforms from one 
space-time co-ordinate system to another equally permissible 
one, the quantities 

cH, — x, — y, — z 

must consequently transform contragrediently to t, xyz-; they 
are therefore the components of the vector associated with 

R9Q 
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(£, # y z) in the space which is the dual of the 4-dimensional space- 
time world. Such a dual vector is given by 


H ) Px > pv) Pz ) 

or, what amounts to the same thing, 

Hdt — {p x dx + p y dy + p^z) 

is invariant under Lorentz transformations. The same is true 
of the total differential operator 

d = \-dt + (~dx + ~dy + ^-dz) 

7)t Xbx 7>y ' t*z I 

applied to an arbitrary function of t ; x , y, z . Hence the corre- 
spondence (2.5) necessarily implies the further relations 


Pz 


h t) 
i 7>x } 


Pv 


h h b 

i by’ b? 


(2.6) 


which are to be given the analogous interpretation. 
A homogeneous plane wave 


i/i = a • *<(-•<+«* + »+**) ( 2 . 7 ) 

is simultaneously a characteristic function of the four mutually 
commutative operators (2.5), (2.6), which has as characteristic 
numbers 

H — hv, p. x — ha, p v = /i0, p , = hy. (2.8) 


It represents a state in which the energy and linear momentum 
of the quantum possess these sharply defined values. 

In classical mechanics the laws governing the motion of a 
particle are known as soon as we express its energy H in terms 
of the “canonical variables" xyz, p x p v p t . In Newtonian 
mechanics the Hamiltonian function for a free material particle 
of mass m is 


rr Px Py Pt . 

2m 


(2.9) 


on employing the transition scheme developed above we obtain 
the corresponding wave equation 


h bip 

i bT 




j£, 
bx J ' + by* 



( 2 . 10 ) 


(2.7) is a solution of this equation provided the values (2.8) of 
energy and linear momentum satisfy equation (2.9) ; in this 
sense (2.9) and (2.10) are equivalent. But the equation (2.10) 
is linear and has as its most general solution a linear super- 
position of simple waves (2.7) ; such a superposition corresponds 
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to a state in which the energy and momentum of the particle 
assume their various permissible values “ with a certain definite 
probability.” 

The space vector (a, ]8, y) in (2.7) gives the direction of 
propagation of the plane wave, and the modulus of this vector 
is the wave number fi (the number of waves contained in 2 tt 
units of length; % r//x is the wave length A). Hence by (2.8) 

2rrh 

the absolute value p of the momentum is equal to hfi = -y . 
v 

- is the phase velocity of the wave ; in accordance with (2.9) or 



it is h\xj2m = hTr/Xm and depends on the wave length or frequency 
(dispersion). Since p = mv t where v is the velocity of the 

particle, the “ group velocity ” ~ = ~ = v coincides with the 

velocity of the particle. Experiments on diffraction and inter- 
ference phenomena in electron beams, such as those performed 
by Davisson and Germer , have made it possible to test directly 
these relations set up by de Broglie. 

In relativistic mechanics we have in place of (2.9) an equation 
which states that the square of the absolute value of the energy- 
momentum 4-vector is constant and equal to m 2 c 2 : 
r /2 

^£~{pl + Pl + Pl) = ( 2 . 11 ) 

or 

H = c s/ mh 2 -f- (pi + pi + pi). 


For the transition to a wave equation it is of advantage to employ 
the rational form (2.11) of this expression : 


L 'bH . k I m 2 c 2 . 

-’W-'-W-’-jr-t- 


( 2 . 12 ) 


Here again the group velocity is equal to the velocity v of the 
particle, but the phase velocity is found to be c 2 jv ; the former 
is always less, the latter always more than the velocity of light. 
In order to return from the relativistic to the “ ordinary ” or 
Newtonian mechanics by passing to the limit oo, we must 

( me 2 1\ 

hTj * 

The differential equation governing light waves can be ob- 
tained from (2.11) by dropping the term on the right-hand side. 
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Hence from the corpuscular standpoint light consists of photons 
or particles of proper mass 0 : 

— (pi + Pi + pi) = 0. 

In accordance with the expression (2.1) for the probability 
density, we are to consider as the vector in unitary system-space 
describing the state of the system the function tfi in so far as it 
depends on the spatial co-ordinates xyz. The integral of (2.1) 
with respect to the spatial co-ordinates gives the probability 
that the particles will be found “ within the volume V at time t." 
Space and time must be separated from one another ; the system 
has at each time t a definite state <p[xyz), which will in general 
vary with t. The operators which represent physical quantities 
must accordingly be ones which operate on an arbitrary function 
of the spatial co-ordinates. This requirement is satisfied by 
the operators (2.6) corresponding to the momentum co-ordinates, 
but not by differentiation with respect to time, which we have 
associated with the energy. We must instead consider the 
situation as described as follows : from the expression for the 
energy in terms of the canonical variables p x , p v , p t we obtain 
the operator H which represents the energy and which operates 
on the function ip( xyz ). The equation 

^ + 0 

is then the dynamical law which determines the change in the 
state ip in time. 

The separation of space and time offers certain difficulties 
to the development of quantum theory from the relativistic 
standpoint ; consequently, for the present, we base our develop- 
ment on the Newtonian mechanics. 

Our procedure must eventually be modified in another 
important respect : we have here tacitly assumed, for the sake 
of mathematical simplicity but without physical justification, 
that the wave field of a material particle is described by a scalar 
quantity ip. The modification, which is required in order to 
give an adequate description of the facts of spectroscopy, will 
be made in Chap. IV. 

§ 3. Schrodinger’s Wave Equation. The Harmonic 

Oscillator 

When the particle is moving under the influence of forces 
the kinematic part (2.9) of the energy is augmented by the 
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potential energy, which usually depends on the co-ordinates 
alone and not on the momenta. We must therefore know 
which Hermitian operator acting on p corresponds to the co- 
ordinate x. I assert that it is multiplication by x ; this operator 
is already referred to its principal axes, its characteristic values 
are all real numbers # and finally ifj(; x), or more precisely ip(x)V dx, 
is the component of the “ vector ” associated with the character- 
istic number x (we have here ignored the other co-ordinates y, z). 
In accordance with the statistical interpretation of the relation- 
ship between physical quantities and operators, our assertion is : 

x t 

the probability that ^ has a value between x x and x 2 is ^tfnjtdx] 

this is in agreement with the expression (2.1) for the probability 
density. If V(xyz) is a function of position in the 3-dimensional 
space, e.g. the potential energy, then the physcial quantity V 
is represented by the operator 

p V(xyz) • p, 


for the probability that V lies between V x and V 2 is given by the 
integral 


j* j* j* $ifjdxdydz 


extended over that portion of space in which V x ^ V(xyz) g V t . 

The operators corresponding to x , y, z commute with each 
other, but the operator Q corresponding to x and the operator 
P corresponding to p x do not. In fact 


or 


PQ-QP = j 1 


where the 1 on the right-hand side stands for the operator 
identity: ifj[x) \fj{x). Because of this non-commutative re- 
lation between the operators P and Q, p x cannot assume a definite 
value with certainty ivhen x does , and conversely. In fact, if p x 
is known to have the value Aa with certainty, then the dependence 
of p on x is given by the factor e i<xx ; in consequence of this the 
position x of the particle is entirely indeterminate, since the 
probability tfjifj of localization is the same for all points x. 

If V(x, y, z) is the potential energy of the field in which the 
particle moves, the total energy is 

p* p2 
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We assume with Schrc dinger that in spite of the fact that all 
our variables do not commute we may still apply our rules for the 
formulation of the wave equation ; we thus obtain Schrodinger's 
differential equation 

We understand by “ stationary ” or “ quantum states *’ ift those 
in which the energy E has a definite value ; they are character- 
ized as solutions of the wave equation which satisfy in addition 
the equation [cf. (2,5)] 

i 7>t v 

On setting E = hv ) such a iff will have the form e~ ivi • ifr where 
the new function denoted by iff is independent of t . This function 
ifj(xyz) ) which depends only on the spatial co-ordinates, satisfies 
the reduced equation 

A 2 

W+[E-V(xyz)]+=0. 

The problem is thus reduced to finding values of E and functions 
i/i 4 = 0 of position which satisfy this equation and are such that 
the integral of tpifi over the entire space is finite. They are the 
characteristic numbers and characteristic vectors of the Hermitian 
operator H associated with the energy (3.1) in the function space 
of all functions of position ip. The characteristic numbers E 
are the possible energy levels of the particles. 

Before going any further into the interpretation of the theory 
we have developed, it will be well to convince ourselves that it 
leads to energy levels which are in agreement with the facts. 
The simplest example is that of the linear oscillator ; with it 
we are dealing with only one co-ordinate x. The potential 

energy is V{x) = and the total energy 

" = ■5(1? + “4 M 

The equation for the determination of the characteristic values 
E and the associated characteristic functions tff is 

=3 + (*-S")**-a 


(3.3) 
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Hermitian polynomials . The solutions of this equation are 
expressed in terms of Hermitian polynomials. The n ih Her- 
mitian polynomial r) n {x) is defined by the equation 

-^(^) = (- l) n e-i*'-Vn(x); (3.4) 

it is of n th degree and the highest term is exactly x n . The 
Ini*) {n == 0, I, 2, • * •) constitute an orthogonal set of functions 
with the “ density function ” e~ x *l 2 : 

+ 00 

Je - * 3 / 2 rj n [x)r} m (x)dx = 0, m 4= n ; (3.5) 

- QO 

the functions 

<f>n{%) = e-^l* • !)«(*) 

are consequently orthogonal in the ordinary sense. To prove 
this we need merely to note that 

+ 00 

(- l ) n \j^n( e ~ Xt ' 2 ) • Vm(x)dx 
- 00 

becomes, on integrating n times by parts, 

. ZsMi, 

J dx n 

— CO 

and the integrand vanishes for m <n. For m = ft we obtain 

+ 00 

n ! ^e" xt I 2 dx 
- 00 

so the equations (3.5) can be supplemented by 
+ 00 

y\ = ^e~ x * l2 r]n(x)dx = n! \/27r. 

— 00 

From (3.4) we have 

Wn-H x 

,«(*) =-(- l ) n d^l(e~ x ' 12 ) 

or ^(£»)- Since 

(e~ x *l 2 ) 
dx K ' 


and we can consider as either 

dx nhl dx f 


K£) 
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the first of these interpretations yields the recursion formula 

Vn+i(x) = n[x) - nr, n _ i(ar). (3.6) 

From the second we find 


- ' Vn(x)] = • Vn+1 {x) 


or 


W^) = -^ n + ^»W. (3-7) 

On subtracting the recursion formula (3.7) from (3.6) we find 
the simple relation 


drj n 

i = »*-* 


(3.8) 


Differentiating (3.7) and substituting ( n + l)ij n for the derivative 
of ?; n+1 in accordance with (3.8), we obtain the differential equation 


dx* 


~~ x ii + nVn 


= 0 . 


The equation for <f> n (g) is consequently 

^r-T<t>» + ( n + l)<t>n= o. (3.9) 

On going over to a new unit of length by the substitution 
at == oc^, the left-hand side of (3.3) is equal to the left-hand side 
of (3.9) multiplied by /& 2 /2woc 2 provided 

JL.l JL(„ + 1 \ =b 

2m«* 4 2 ’ 2 1 2/ 1 


Let co = Vajm denote the classical frequency of the oscillator. 
The first of these conditions determines the new unit of length a : 

2= h = h 

2 am 2wa>’ 

and the second requires that 

E = E n = ha>{n + |). (3.10) 

It is possible to show that the <f> n (£) constitute a complete ortho- 
gonal system, 8 and consequently there can exist no further 
characteristic numbers and functions. The oscillator possesses 
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the discrete energy levels (3.10) at intervals hay apart. That the 
lowest energy level turns out to be l hay instead of 0 is of itself of 
no significance, as we may always introduce an additive constant 
into the energy, although it is meaningful to assert that the least 
possible value of the quantity H, (3.2), is equal to \hay. 

However, the wave equation not only yields the energy levels 
as characteristic values, but it also gives us information con- 
cerning the probability of localization by means of the character- 
istic functions. For convenience we now take oc = \ tJ— as the 

> 2 mco 

unit of length. When the oscillator is in the state described by 
the n th energy level , the probability that the oscillating particle is 
at a distance x from its position of equilibrium is given by 
e~ x2 I 2 • rjl(x). These probabilities are to be understood as 
relative, and refer to equal infinitesimal intervals about the 
points of comparison #. In particular, for the lowest energy 
level n — 0 the probability density is e~ x *l 2 ; we can therefore 
no longer say that the mass-point is at rest in the position of 
equilibrium, but rather the probability of its displacement from 
this position is given by a Gauss error curve. The normalized 
characteristic functions of (3.3) are given by 

</.„(*) =A <£»(*). 

7« 

On expressing any function p(x) of position in terms of this set 

00 + 00 
~ ZX’/'nM, *n = mxWnitfdx, 

-oo 

and the operator belonging to the energy H is, as we have already 
seen, expressed in terms of these co-ordinates ifj n by 

x n -> hay(n + i) • X n . 

In order to find the operator associated with the co-ordinate x 
we must express xi p n (x) linearly in terms of the characteristic 
functions themselves ; by (3.6) we have 

X<j> n = fn+1 + 1 

whence 

xi/j n = 1 Ipn-l = Vn + 1 \fj n+l + Vn 

7n 7n 

The correspondence ify(%) -> xtf/(x) is thus expressed in terms of 
these Fourier coefficients by 

x n Vn x n _ x + Vn + 1 x n+1 ; 
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its matrix ||? ntn || contains only the elements 

?». n-i = Vn, q n , n+1 = Vn + 1. (3.11) 

(On returning to the original unit of length the right-hand side 
must be multiplied by the factor a.) On applying the operator 

to <f> n we obtain, in accordance with (3.8) and (3.6), 

whence 

^ = UVn tp n _ x — Vn+l if> n+1 ). 

The linear Hermitian correspondence associated with the mo- 
mentum p = j ^ is accordingly 

* B “*■ Vn X „-l + v» + 1 X n+1 ) ; 

its matrix \\p nm \\ has as its only non-vanishing elements those 
for which m = n ± 1 : 

£n, »-l = — 4 V”> Pn,n + 1~ IfiV n+1. (3.12) 

(On returning to the original unit of length these elements are 
to be multiplied by 1/a. — Terms with the index n — 1 are to 
be omitted when n = 0 ; in fact, they automatically drop out 
of the above formulae.) 


§ 4. Spherical Harmonics 

In order to discuss the energy levels of an electron in a 
spherically symmetric electrostatic field we must first discuss 
spherical harmonics and their principal properties. 

1 . Definition . — Let r denote the distance from the origin in 
the 3-dimensional space with co-ordinates x , y , z , and let r, 0, 0 
be polar co-ordinates with polar axis along the positive z 
direction : 

x + iy == r sin 9e 2 = r cos 0. 

On setting a homogeneous polynomial w of 0 th degree in #, y, a 
equal to r* * Y lt Yi depends only on the directional co-ordinates 
0, $ and is a function of position on the unit sphere. If u is 
a harmonic function , i.e. if it satisfies the equation A u — 0, 
Y i is said to be a surface harmonic of degree l and the harmonic 
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function u itself is said to be a spherical ( or solid) harmonic of 
degree L Since in polar co-ordinates 

A It) / 0 ()U\ 1 A 

A “ ■) + f* / 1u ' 

. 1 f7> { . J)U\ , 1 

Au = siwr 1 e w + sre spj < 4 -'> 

the surface harmonic Y x satisfies the differential equation 

AYi + l(l+l)Y % = Q. (4.2) 

2. Orthogonality . — On applying Green’s formula to the 
spherical harmonics u = r k Y k) v = r x Y x on the interior of the 
unit sphere, we obtain the orthogonality relations 

J 7* 7,^ = 0, &=M, (4.3) 

in which doo = sin dddd<f> is the surface element on the unit sphere. 
Since the conjugate complex Y k of a surface harmonic is also a 

surface harmonic, the first factor in (4.3) can be replaced by Y k . 

3. Basis . — On writing 

f = * + iy, rj == x iy 


the differential equation Au = 0 becomes 


A u s 


<>77 


+ 


(> 2 U 

c)2 2 



we see that a homogeneous polynomial u of degree l in 77, 2 
breaks up into harmonic polynomials u^ m ) : 

u = Eul m \ (m = — l, - • *, l — 1 , l) 

where u (tw) consists of all terms in which the exponents of £ and 
k] have the fixed difference m. The recursion formula for the 
coefficients of which is obtained from the differential 

equation A u = 0, further shows that there exists one , and to 
within a multiplicative constant only one , such harmonic u (rn) . 
Accordingly, there exist exactly 2/ -f 1 linearly, independent 
surface harmonics of degree l ; we may take them to be the 
Y ^ defined by 

= Y" • Y^p. 

Writing 

u( m) = (x — iy)~ m • P = (x + iy) m * P* 


and r placing 


(x + iy)(x — iy) by r 2 — z 2 , 
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P and P* depend only on r 2 and 0 . Hence on taking r = 1 
we have 

y(in) = e im* ( s j n 0)-» . p(m) ( cos (4.4) 

For m = — l we take P = 1, and for m = + Z, P* = 1 ; 
P(js) = (1 — 3 2 )* for this latter case. Since depends on 

<f> only in the factor e im * 

J V^Y^dco = 0, m' 4= m. (4.5) 

This basis Y (l f\ in which the 0 -axis occupies a preferred position, 
is accordingly unitary-orthogonal. 

4. Completeness. — That the totality of surface harmonics 
constitute a complete orthogonal system on the unit sphere can 
be proved by showing that any polynomial in x , y , z on the 
sphere can be written as a sum of surface harmonics. Now 
the general polynomial of degree l contains 

(l + 1) + l + (J — 1) + ' • • + 1 

arbitrary constants. But exactly this same number of linearly 
independent homogeneous polynomials are contained in the 
expression 

r\Y l + Y + • • •)[= u i + (* 2 + y 2 + * 2 K-« +•••]> (*•«) 

for the polynomials of the form r l Y h r l Yi_ 2) • • • are linearly 
independent in virtue of the orthogonality of surface harmonics. 
r l Y i contains exactly 21 + 1 = (Z + 1) + Z linearly independent 
functions, and consequently (4.6) contains exactly 

((/ + 1) + /] + [(/ — 1) + rt — 2)] + • • •, 

as asserted above. 

5. Closed expressions for the surface harmonics. — On sub- 
stituting (4.4) in (4.2) we obtain the differential equation 

(1 - z»)^ t + 2 (m - 1 )z d f z + [/(/ + 1) - m(m - 1)] • P = 0 


for the polynomial P = Pty in 0 = cos 6 . From this equation 
we find that satisfies the same differential equation on re- 
placing m by m — 1 ; we thus obtain the recursion formula 


P<?>(s) = 


d l ~ n 

dz l ~ m 


.(1 


and the expression 
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In particular, the “ zonal harmonic ” 


Pi(*) 


- P ( f(z) = 


d l (l — z 2 ) 1 
dz l ' 


6. Further formula . — 

\xY lc Y l doj = 0 (4.7) 

unless l — k = ± 1. For x*r k Y k is a polynomial of degree 

k + 1 and may, in accordance with 4, be expanded in the form 

r fc+1 (7 fc+1 -f- Yjc~i +•*’)• Consequently on the unit sphere 

*Y h = Y m + 7,., + * • - (4.8) 

and the only values of l ^ k for which the integral (4.7) can 
have a value other than 0 is l = k + 1. Hence our assertion 
(4.7) ; it also follows from the above that only the first two 
terms can appear in (4.8). 

Further, we shall also have occasion to use the differential 
expressions 

l ’ u • L - u ' (4 ' 9) 

L 2 u = L x (L x u) -(- Ly[L v u) L z (L x u) 


in terms of polar co-ordinates. On setting in 
du 


7m , .7m j . 7m , 

—dx + —dy + 
lx ly 1st 


the changes dx, dy , dz obtained by allowing <f> to increase by 
d<j> and holding r, 6 fixed, we obtain immediately 

1 lu 
i l(f> 

Similarly, 

(l , . cos e 7> 


Lm 


L x -f- iL v 


\ld % sin 8 l(f>. 


)■ 


r • r -ilhf 7) . .COS 0 l \ 

L * ~ e ( id + 1 sin 9 
- A [eq. (4.1)]. 


U 


(4.10) 


(4.10) 


§ 5. Electron in Spherically Symmetric Field. 
Directional Quantization 

Now back to physics ! Consider an electron of charge — e 
revolving about a fixed nucleus of charge Ze situated at the 
origin. For Z = 1 we have the hydrogen atom, for Z = 2 
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singly ionized helium He + , for Z = 3 doubly ionized lithium 

Ze 2 

Li ++ , etc. The potential energy is V = — ; we shall, 

however, for the present take V (r) more generally as any function 
of the radius r. The wave equation for the determination of 
the energy levels is then 

^At + [E-V(r)W = 0. (5.1) 


On expanding in terms of surface harmonics t/j becomes a sum 
of terms fi(r)Y t (l = 0, 1, 2, • • •). The differential operator 
on the left-hand side of (5.1) sends the Z 01 term of this sum into 
Y x times 

{6 - 2) 

Consequently each individual term must satisfy the differential 
equation separately ; we thus obtain a complete set of char- 
acteristic functions of the form 

The factor f t (r) depending only on r must be such that (5.2) 
vanishes and converges. Denoting the char- 

acteristic numbers and characteristic functions of this differ- 
ential equation by 

Em, fm[r) (» = 0 , 1 , 2 , • • ■), 

E n i is a (2/ + l)-fold energy level, as the expression f nt (r) Y x 
contains 2Z -f- 1 linearly independent characteristic functions 
associated with this single characteristic value ; we may choose 
as a basis the functions 

W = Ur)'Y^ (m — — l,* • •, Z — 1, /). 

We thus arrive at three integral quantum numbers : the 
“ radial quantum number ” n, the “ azimuthal quantum number ” Z, 
and the “ magnetic quantum number ” m. The energy level 
depends only on the first two. 

In justification of this nomenclature we determine the angular 
momentum h2 of the electron with components 
hL x = yp M — zp v , • • •. 

In quantum mechanics L x , L Vi L z are the operators (4.9). 
Hence for 

f nl (r) Y ( 7^ = • (a function of r and 0) (5.3) 
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we have, in accordance with (4.10), 

L z ip — m • p, 

and for the general characteristic function 

P = ,fnl( r ) Y I (5.4) 


with azimuthal quantum number l 

L 2 i/j = l (l + 1 ) 


Hence in the state described by (5.4) not only the energy has 
a definite value E nh but also the absolute value of the moment 
of momentum 


£ 2 - l{l + 1 ) 


(5*5) 


The significance of the azimuthal number is that it fixes this 
magnitude. It is indeed remarkable that there exist states 
l = 0, n = 0, 1, 2, • • • with spherically symmetric character- 
istic functions i/j == f n0 {r) for which the moment of momentum 
vanishes. In the states described by (5.3) not only the energy 
and the absolute value of the moment of momentum have 
definite values, but also the z-component of the moment of mo- 
mentum assumes a definite value with certainty , for then 



(5.6) 


Since a magnetic dipole moment 


8 = 



(5.7) 


is associated with the angular momentum AS of the revolving 
electron (the mass of the electron being denoted by fi whenever 
there is danger of confusion with the magnetic quantum number 
m), the influence of S will be felt on subjecting the atom to a 
magnetic field. The existence of the Zeeman effect under such 
conditions can be traced to this cause. A fundamental ex- 
periment to observe the magnetic moment of the electron directly 
is due to Stern and Gerlach. Let a stream of one-electron atoms, 
which are all moving in the direction of the #-axis and are in 
the state (n, l) with energy level E nh be subjected to an in- 
homogeneous magnetic field in the direction of the 2 -axis. Let 
' the x- and y-components of the magnetic field vanish in the 
(x-z) -plane, in which the beam moves, and let the 2 -component 
be a function of 2 alone. A magnetic dipole, the 2 -component 


of whose moment is s z , is then acted upon by a force -j- • s z 
in the positive 2 -direction. In consequence of (5.6) the atomic 
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beam should be broken up into 21+1 smaller beams by the 
force in the 2 -direction, corresponding to the various values 
m = l, l — 1, • • •, — l of the magnetic quantum number. 
On performing the experiment on silver atoms in the normal 
state two beams, corresponding to m — ± 1, were observed ; 
the value of the “ Bohr magneton," the elementary magnetic 
moment corresponding to one unit of angular momentum, was 

sh 

found to agree with the value ^ obtained from (5.6) and (5.7). 

Why the unperturbed beam corresponding to m = 0 did not 
appear remained unexplained. 

The older quantum theory, which employed the quantum 
number k = l + 1 with values 1, 2, • • •, allowed m to assume 
the integral values from — k to + k ; it seemed plausible to 
exclude the case k = 0, although one was thereby led into 
difficulties on applying the so-called 4 ‘ adiabatic hypothesis ” 
to the behaviour of an atom under the influence of crossed 
electric and magnetic fields. In the new quantum theory no 
ad hoc hypothesis is required for this exclusion, as l can assume 
only the values 0, 1, 2, • • \ But according to either the old 
or the present scalar wave theory there should exist an odd 
number of permissible values of m for given fe or l; the exclusion 
of the case m — 0 apparently required by the Stern-Gerlach 
experiment cannot be accounted for on either theory. Nor 
can we explain the related fact that in the anomalous Zeeman 
effect m may assume either an even or an odd number of values, 
according to the nature of the atom under consideration. 
Obviously something is lacking in our present scalar wave 
theory as well as in the older formulation ; we return to this 
point again in Chap. IV, § 4. The older quantum theory 
described the situation met above as “ directional quantiza- 
tion 99 ; since the absolute value of the moment of momentum 
was hk and the component along the 0 -axis was hm ) it concluded 
that the magnetic axis of the atom could assume only positions 
described by the inclination 0 with the 0 -axis determined by 
the formula 

. cos 0 = J (m = 0, ± 1, ± 2, • . ± k). 

Thus in the case k = 1 we should expect only three possible 
orientations for the magnetic axis : parallel and anti-parallel 
to the field, which we have taken in the direction of the 0 -axis, 
and perpendicular thereto — unless we empirically exclude this 
latter possibility m = 0 because of the Stern-Gerlach experiment, 
in which case we have but two. In either case we find ourselves 
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faced with a serious dilemma, for the direction of the 0 -axis is 
an arbitrary direction in space. In order to avoid this one 
then assumed that the quantization was due to the influence of 
the magnetic field, and consequently the preferred 0 -direction 
was interpreted physically as the direction of the magnetic field. 
But even so the difficulty is not avoided in the limiting case of 
vanishing magnetic field, for the directional quantization should 
be maintained in arbitrarily weak fields. Or stated more 
physically, the radiation mechanism required by the Stern- 
Gerlach effect for the orientation of the atoms, which were 
originally in random orientation and processing about the 
0 -axis, requires about 10 8 times as long as the greatest time 
consistent with the observations. The stand taken by the new 
quantum theory on this point is fundamentally different. The 
possible states (n, l ) of the atom are described by the functions 
iff of the (21 + l)-dimensional linear family 

4‘=fni{r)Y l =2x m -fJr)Y^ 

m=*—l 

or by the vectors of a (21 + l)-dimensional space with com- 
ponents x m . The z-component of the moment of momentum , as 
well as the component in any arbitrary direction , is capable of 
assuming only the discrete values hm (m = l, l — 1, • • *, — l). 
But in a state in which the 0 component, for example, assumes 
the value hm with certainty there is only a certain probability 
that any other component will assume a definite one of its 
possible values h* 0, h * (± 1), • * *, h * (± l). The name 
“ directional quantization ” is hardly an appropriate description 
of this situation. 9 

When the electro-static central force satisfies the Coulomb law 
and originates in a nucleus of charge + Ze } the differential 
equation (5.2) for the “ radial characteristic function ”/ = f nl (r) 
becomes 

®-^/) +£(*■+**>/=<». 


The character of this equation is unchanged on going over to 
the new dependent variable v defined by rf = e~ <xr • v : 


d*v 
dr 2 


~ dv 
2 *dr + 



2 mE\ 2 mZe 2 

HFJ + 


Uf J r 1 ) 


} v 


We choose a in such a way that the constant term in the co- 
efficient of v vanishes : 


h 2 ct 2 = — 2 mE. 


(5.8) 
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We know from the general theory of linear differential equations 10 
that there exist solutions of this equation in the neighbourhood 
of the (regular) singular point r = 0 in the form of a power 
series 

v = £ a ^ 

in which the exponent fi begins with a certain value jjl 0) which 
need not be an integer, and runs through the values m + 1, 
^ -j- 2 , • • \ On substituting this power series into the equa- 
tion we find the recursion formula . 

+ 1) — Z(Z + l)}dWi = J (5.9) 


for the coefficients In order that it be satisfied for /x + 1 =/u 0 
fa = 0, a M+1 4= 0) we must have 

Mofoo’-l ) = 1 ( 1 + 1 ). 

We thus have the two possibilities : 

/*o = Z + 1 or /x 0 = Z. 


Considering the first possibility and taking the coefficient a l+1 
of the lowest power as unity, all remaining coefficients can be 
obtained by successive applications of the recursion formula 
(5.9), as the denominator /x(/x +1) — Z(Z + 1) never vanishes ; 
let the solution thus obtained be denoted by v. The second 
possibility does not lead to a solution, however, as the denomi- 
nator in the recursion formula for /x = Z vanishes ; the second 
solution of the differential equation can be obtained by quad- 
rature from the first and involves logarithmic terms. 

The power series for v breaks off if for a definite exponent 
P = Mo + n 

Zme 2 
or 


u _ Zme 2 

T+Ty 

In this case / is of the form 

e~" • r l • (polynomial of degree n in r ) ; 
it is finite at r = 0 and the integral 


( 5 . 10 ) 


f r'f{rW)dr 


( 5 . 11 ) 
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exists, as is to be required. The corresponding characteristic 
numbers E are the energy levels ; on writing n in place of 
n + 1+1 and solving (5.8), (5.9) for E we find 


Z 2 me 4 1 
2A 2 n r 


(5.12) 


The integer n, the principal or total quantum number, is 
subject to the condition n > 1. There exist no other solutions 
for which the integral (5.11) converges. 8 

The energy levels depend only on the principal quantum 
number n ; the terms for which it is a fixed number and 

^ 0, 1, • • •, ra — 1 coincide in a single degenerate term E„ 

of multiplicity ” 

E (2/ + 1) = n s . 

1=0 


This theoretical result agrees with the empirical formula for the 
Balmer, Paschen, Lyman, etc., series. We find, in fact the 
expression ’ 


Z*R 
n* ’ 


R _ mef 
~ +rh»c 


for the terms measured in wave-numbers (— = — The 

\2ttc 2nck/' 

expression for the Rydberg constant R in terms of the fundamental 
constants of nature [the charge and the mass of the electron, the 
velocity of light and the elementary quantum of action ) agrees 
numerically with its empirical value. All terms and therefore 
all actual line frequencies v depend on the integer Z describing 
the charge on the nucleus in such a way that Vv increases in 
proportion with Z. Since the X-ray terms are due to the inner- 
most electrons, which are but slightly affected by the outer 
ones, we should expect to find that the hardest X-ray lines, 
arranged in accordance with the atomic number Z, follow this 
law. It was discovered by Moseley and gave a conclusive proof 
of the fact that on going through the elements of the periodic table 
the charge on the nucleus increases by e from element to element. 
This . law uncovers with unerring certainty the holes yet re- 
maining in the system of known elements ; at present we lack 
but 2 (or 3) elements in the series beginning with hydrogen, 
Z = 1, and ending with uranium, Z = 92. 

The characteristic functions associated with these energy 
levels, which determine the relative probabilities of the various 
positions of the electron, can be expressed in closed form in 
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terms of the so-called Laguerre polynomials. The character- 
istic function belonging to the normal state n = 1, l = 0, is 
spherically symmetric : * 


for hydrogen 


iff = — L= • e~ r l a ; 

V no? 

(5.13) 

a = 5 = 0-532 A 

me 2 

(5.14) 


(According to the older Bohr theory, a is the radius of the inner- 
most electronic orbit.) a determines the order of magnitude 
of atomic dimensions. In the normal state hydrogen possesses 
spherical symmetry (according to the scalar wave theory — but 
see Chap. IV, § 8). 

The radial characteristic functions r m f ni (r) do not, however, 
constitute a complete orthogonal system for a given l for the 
full domain which we wish to consider : in addition to the 
discrete term spectrum (5.12) we have the continuous spectrum 
covering the whole region E ^ 0. We go no further into this 
matter. 11 


§ 6. Collision Phenomena 

The optical phenomena show that the quantum theory leads 
to the correct energy levels, but they do not lend themselves 
to an attempt to interpret the vector iff in system space as a 
probability. Collision phenomena, which deal with the de- 
flection of electrons or a-particles under the influence of other 
material bodies, are best suited for this latter purpose. The 
fundamental experiments of Franck and Hertz , as well as those 
of Davisson and Germer } belong to this latter category. 

Neglecting the reaction of the moving particle on the per- 
turbing body, the potential energy due to this latter may be 
taken as a given function V(xyz) of position. Considering 
a one-dimensional problem, the energy of the moving particle is 
then 

H-if+FW. 

We can think of the curve y = V[x) as the contour of a hill 
against which the particle runs. The wave equation for a 


•The normalizing factor ijjira* is calculated from 

00 

I J ^ e ~ Zr l a d x dydz = 4?r = 7 ra % . 
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h?_ djty 
2m dx 2 


+ [E - V(x)] 1 ^= 0 . 


71 

( 6 . 1 ) 


If we neglect for the moment the perturbing field V we obtain 
as solutions of (6.1) the familiar de Broglie waves : ip is a linear 
combination of the waves e ixx and e~ ixx proceeding in the positive 
and negative directions along the x-axis, the wave number a 
of which is determined by 

(hoc) 2 = 2 mE or ha = p. 

Writing F 

~V{x) = U(x) 

equation (6.1) becomes 

^ + [a 3 - U(*)] ip = 0. (6.2) 


We now assume that as x -> ± oo, t/(#) behaves in such a wav 
+ 00 ^ 

that the integral J|f/(x)|<&t converges; equation (6.2) then has 
— 00 

one solution which behaves for x + oo asymptotically like 
e '“ x > and another, which is linearly independent of the first, 
which behaves like e~ ,xx in the same region. 

This can most readily be seen by solving (6.2) by the method 
of successive approximations. Let 

P = Po + Pi + p 2 + • • ■ ( 6 . 3 ) 

and take as the 0 th approximation the function e ixx ; in general 
pn+i is determined in terms of p„ by integrating the equation 

^r 1 + “* Pn + l = U(x)p n . 

Hence 


Pn+i(x) — — -Jsina(* — £) • [7(f) >p n (p) dp. (6.4) 

X 

We restrict ourselves for the moment to a region x ^ x 0 such 
that 


l -\\U(x)\dx = g<l. 

x 0 
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If | tfr n (x) | for all x, the integral (6.4) converges and we have 

00 

X 

we can therefore take a 0 = 1, a n+1 = ga‘ n . Then a n = g n or 

for x^x 0 . 

Consequently the series for ip converges at Least as fast as the 
geometric series with ratio g. It satisfies the integral equation 

00 

ft*) - M*) = - y*in«(* - fl * m m d£ (6.5) 

X 

and is consequently a solution of (6.2). Since 

W*)l^i +£ + *?* + • • * = 

(6.5) leads to the estimate 

00 

X 

from which it follows that ip(x) behaves asymptotically for 
x -> + oo like ip Q (x) = e i<xx . Not only is ~ ip Q) but also 

itc ~ *^bc’ ^° r e( l uat ^ on 

00 

& - if = - J ““<* - « ■ u ©«)<« 

X 

gives as an upper bound for the absolute value of the difference 
on the left-hand side the quantity 

00 

rh-fTOl# 

X 

which approaches 0 as x -* + oo. 

The solution ip(x) which we have found in the region x ^ x 0 
can naturally be extended over the entire real axis by analytic 
continuation. Since our considerations apply just as well for 
#-> — oo, we know that \p(x) satisfies an asymptotic equation 
of the form 

ip(x) be 101 * -f- b'e~'* x for x-+ — oo. 
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At the same time we must also have 

ia(be iax — b f e~ % * x ). 

p(x) being a solution of the differential equation, fi(x) is 
also : 

g + [a* - U{x)}4> = 0, d ^ + [ ^-Um=0. 

Multiply the first equation by the second by ip and subtract ; 
we find 

or 

^S“^K = const (6 - 6) 

The determinant (6.6) has the limiting value 2i<x for # -> + oo 
and for # oo 

2ta(W - &'*'), 

whence 

bb~b'b'~ 1. (6.7) 

It follows from this that b =(= 0. On multiplying iff(x) by 1 jb 
we have a solution ip whose asymptotic behaviour is described 
by the equations 

\p(x) r^j e ioiX + a'e~ iexx for # — oo, 

ifj(x) r^j ae ioiX for # + oo (6.8) 

where a = 1/6, a' = b'jb, (6.7) is now 

|a| 2 + |a'| 2 = 1. (6.9) 

A particle of definite energy runs against the potential energy 
hill from the left , i.e. from x^= — oo. Whereas in classical 
mechanics the particle certainly either gets over the hill or is thrown 
back , according to whether its initial kinetic energy is greater or 
less than the maximum of V{x) ) quantum mechanics states that 
there is a probability \a\ 2 that it gets over and a probability |a'| 2 
that it is thrown hack . Furthermore, these probabilities are 
continuous functions of the energy of the particle ; the dis- 
continuity of the classical theory is completely broken down. 
If we perform the experiment successively with a large number 
of particles we find that they are divided into two streams, 
in accordance with (6.8.), proceeding in the positive and negative 
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directions along the a>axis ; the relative intensities of these 
are given by 1 and |a'| a for x-+ — oo, respectively, while for 
x -j. oo there exists only the positive stream of intensity 
|a| s . Equation (7.5) thus expresses the conservation of the 
number of particles and shows that we must consider the square 
|a| a of the absolute value of the amplitude a as a relative intensity 
or probability: 

If the integral 

+ «0 

\ - j I U(x)\dx < 1 

- 00 

the solution </< is represented throughout the whole space by the 
formula (6.3). In perturbation theory one is usually satisfied 
with the first term The theory of the familiar experiments 
of Rutherford, in which a-particles are allowed to fly in a given 
direction with given momentum into and be deflected by the 
field of an atom, has been developed by Wentzel in a similar 
manner. 18 The influence of the oc- particle on the atom is thereby 
neglected ; on taking it into account we are led to the theory 
of the experiments of Franck and Hertz, giving formulae for 
the dispersed particles specified according to their various 
discrete kinetic energies and their various directions. This 
calculation has been carried through for hydrogen by Born and 
Elsasser , 13 A very important application of this picture of 
corpuscular waves “ seeping ” through a potential hill has been 
made by G. Gamow and R. W. Gurney and E. U. Condon to 
explain radioactive decay. 14 

§ 7. The Conceptual Structure of Quantum Mechanics 

The fruitfulness of the theory has been amply established by 
the above applications and the examples given have served to 
illustrate its physical interpretation ; it now seems time to set 
forth its general abstract formulation. 

Consider a physical system of known constitution. Each 
particular state, each individual case of such a system is repre- 
sented by a vector j of modulus 1 in a unitary system space. Each 
physical quantity associated with the system is represented by an 
Hermitian form in this space. The fundamental question which 
we put to the theory is not, as in classical physics, “ What value 
has this physical quantity in this particular case ? ” but rather 
“ What are the possible values of the physical quantity A, and what 
is the probability that it assumes a definite one of these values in 
a given case ? " The answer to this question is : The probability 
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that A assumes the value a is the value E«(j) of the characteristic 
jorm ha. oj A associated, with the value a, where the vector x repre- 
sents the case in question and the quantity A is represented by 
the Hermitian form A in the system space. The quantity repre- 
sented by A is capable of assuming only those values a which 
are characteristic values of the form A. In accordance with the 
equations 

S 2 = EE* (x), A{ 1 ) = 2> Ea(l) 

* <x 

the sum of the probabilities is 1 and the value A{%) of the form 
A is the mean value or expectation of the quantity A in the state r. 
Since all assertions concerning the probabilities in a given state 
X are numerically unaltered when x is replaced by e x, where s 
is an arbitrary complex number of modulus 1, we cannot dis- 
tinguish between these two cases. The pure case or state is 
consequently more properly represented by the ray x than by 
the vector x, and we must therefore operate in the ray field in 
system space rather than in the vector field. 

The significance of probabilities for experimental science is 
that they determine the relative frequency of occurrence in a 
senes 0 / repeated observatio?is. According to classical physics it 
is in principle possible to create conditions under which every 
quantity associated with a given physical system assumes an 
arbitrarily sharply defined value which is exactly reproducible 
whenever these conditions are the same. Quantum physics 
denies this possibility. We illustrate this by the example of 
directional quantization. We know conditions under which we 
can guarantee with practical certainty that the atoms of a 
hydrogen gas are in the normal state. Let us therefore assume 
that we can create conditions under which we can be certain 
that the atoms under observation are in the quantum state (n, l) 
with azimuthal quantum number l — 1 and energy E. ’ A 
certain quantity L z , which can, under these conditions, assume 
only the values + 1, 0, or — 1 is associated with each direction 
z in space. Stern and Gerlach have shown us how to sharpen 
these conditions so that L t takes on a definite one of these values, 
say L„ = + 1. According to the theory the utmost limit of 
precision is then reached. If x is another direction in space, 
then under these conditions which determine L z and E only the 
relative probability that the quantity L x assumes any one of the 
values -f 1, 0, — 1 can be given. Why is it impossible to go 
further and insure conditions under which in addition L x takes 
on a definite one of the values, say 0, with certainty ? Because 
the “ measurement ” of L x , which is accomplished by separating 
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the atoms into three classes L x = + 1, 0, — 1, is only possible 
by creating conditions which destroy the homogeneity already 
existing with respect to L z . Polarization of photons is obviously 
somewhat analogous to directional quantization of atoms. The 
conditions for the production of a monochromatic beam of light 
in a definite direction determine the energy and momentum of 
the photons. To each orientation s of a Nicol prism corre- 
sponds a definite quantity X 8 which is capable of assuming only 
the values ± 1 ; if X 3 = + 1 the light goes through and if 
A* = — 1 it does not. With the aid of such a prism we separate 
out the photons for which A s = 1 without disturbing their 
energy and momentum. The utmost limit of precision is then 
reached ; a monochromatic pencil of polarized light is the most 
homogeneous light possible. If we now place a second Nicol 
of orientation a in the path of this beam, then naturally only 
those photons which have A* = + 1 can pass through. But 
the light which we thus obtain is of the same constitution as 
if the first Nicol of orientation $ were not used at all ; the con- 
dition that all the photons have A, = + 1 is obviously destroyed 
by the second Nicol. 

Natural science is of a constructive character. The concepts 
with which it deals are not qualities or attributes which can 
be obtained from the objective world by direct cognition. They 
can only be determined by an indirect methodology, by observing 
their reaction with other bodies, and their implicit definition is 
consequently conditioned by definite laws of nature governing 
reactions. 15 Consider, for example, the introduction of the 
Galilean concept of mass, which essentially amounts to the 
following indirect definition : “ Every body possesses a mo- 
mentum, that is, a vector wk) having the same direction as its 
velocity h ; the scalar factor m is called its mass. The mo- 
mentum of a closed system is conserved, that is, the sum of the 
momenta of a number of reacting bodies is the same before 
the reaction as after it.” On applying this law to the observed 
collision phenomena data are obtainable which allow a deter- 
mination of the relative masses of the various bodies. But 
scientists have long held the opinion that such constructive 
concepts were nevertheless intrinsic attributes of the “ Ding an 
sick” even when the manipulations necessary for their deter- 
mination were not carried out. In quantum theory we are con - 
fronted with a fundamental limitation to this metaphysical stand - 
j point. 19 

We have already seen, toward the beginning of this chapter, 
that a co-ordinate x and its associated momentum p stand in 
a peculiar relationship to one another : the precise determina- 
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tion of either one of these quantities precludes the precise 
determination of the other. In the state represented by the 
+ 0 ° 

wave function «,) [ J f + dx = j] the mean values ^ _ <i(> ^ 
-00 

P o = (P) are given by 

+°° +® 

J* f(x) <p(x)dx and 

-CO _oo 

No loss of generality is incurred by taking these mean values 
as zero , the first can be made to vanish by replacing x by 
x — x 0 or tfj(x) by i/j(x + x 0 ) and the second by replacing if/(x) 

e ( ~ Ji) ' The mean values (Ax) 2 , (A p) 2 of (x — x 0 ) 2 , 

(P — To) 2 are then given by 
+00 

(Ax) 2 = jx 2 ^(x)^(x)dx, 

— 00 

+ C 0 - j ~ 00 

“00 — 00 

From these expressions the general inequality 

Ap • Ax Si \h 

can readily be obtained (I am indebted to W. Pauli for this 
remark) ; the less the uncertainty in x, the greater the un- 
certainty in p, and conversely.* 

In general the conditions under which an experiment is 
performed will not even guarantee that all the individuals con- 
stituting the system under observation are in the same “ state," 
as represented in the quantum theory by a ray in system space. 
This is, for example, the case when we only take care that all 
the atoms are in the quantum state (re, l ) without undertaking 
to separate them, with respect to m by means of the Stern- 
Gerlach effect. In order to apply quantum mechanics it is 
therefore necessary to set up a criterion which will enable us to 
determine whether the given conditions are sufficient to insure 
such a “ pure state." We say that the conditions (S' effect 
a greater homogeneity than the conditions 6 if (1) every quantity 
which has a sharp, reproducible value under (S has the same definite 


Cf. Appendix x at the end of the book. 
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value under ©' and if (2) there exists a quantity which is strictly 
determinate under ©' but not under ©. The desired criterion 
is obviously this : The conditions © guarantee a pure slate if it 
is impossible to produce a further increase in homogeneity . (This 
maximum of homogeneity was obtained in classical physics 
only when all quantities associated with the system had definite 
values.) 

In the pure state represented by the vector a = (a t ) y a quan- 
tity Q represented by the Hermitian matrix Q = ||^ ffc || has the 
expectation or mean value 


The numbers 


<Q> = Za k diq ik . 

t, k 


a ik — a i&k 


(7.1) 


are the components of a positive definite Hermitian form A of 
trace 1, i.e. 


I (as) 1 2 = • Z&iXi- 

i i 


(Positive definite is to be understood here in the weakened 
sense Afe) ^ 0.) It is to be noted that <j2> depends linearly 
and homogeneously on the quantity ||; <ft || under consideration : 


Q — tr (AQ). 


(7.2) 


If a statistical aggregate A is created by subjecting a large number 
of individuals of the physical system under observation to the 
conditions £, then the mean value of a physical quantity Q 
will be given by (7.2) where A is a certain positive definite 
Hermitian form of trace 1 which is characteristic for the 
aggregate— even if the conditions © do not guarantee maximum 
homogeneity. The reason for this is that (7.2) is still correct 
if we mix statistical aggregates, each of which does possess 
maximum homogeneity, in any proportions ; any statistical 
case may indeed be considered as a mixture of pure states. 
As J. v. Neumann has remarked, this formula (7.2) can be derived 
from the simple axioms 17 : 

... }' ^ f ar e physical quantities and A a real number, then 

<AP > = x < p >> <P + Q> = <P> + <Q>. 

a. it the quantity Q is capable of assuming only positive 
vaiues (i.e. if the form Q is positive definite), then <fi> ^ 0. 

i. If Q is a pure number, i.e. if it is independent of all 
physical conditions, then < Q > = Q, 

„ n ^ SU , ming n ° fc on Jy that any physical quantity Q is repre- 
y an Hermitian form, but also that conversely any 
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Hermitian form represents some quantity associated with the 
system, it follows from (1) that 

<Q> = Za ki q ik , 

i, k 

where the coefficients a ki are independent of Q . (We shall 
return to this assumption in Chap. IV, § 9.) The matrix 
A = !|a ffc || must be Hermitian since < Q > is always real. On 
bringing A into the normal form £&&& (2) requires for the special 
Hermitian forms of the type Q = Zq^Xi that ^ 0 for 

arbitrary non-negative values q { ; consequently cc { ^ 0 and A 
is positive definite. 

The probability that in the statistical aggregate A the quan- 
tity Q assumes the value k is 

w = tr (AE k ) (7.3) 

where E K is the idempotent form associated with the character- 
istic number k. 

We can also distinguish “ pure states ” among general sta- 
tistical aggregates, “ mixed states,” by the fact that they cannot 
be obtained by mixing two or more different statistical aggregates. 
This corresponds to the theorem that an Hermitian matrix A of 
the form (7.1) is not expressible as the sum B -f- C of two positive 
definite Hermitian forms B and C which are not merely multiples 
of A. This can be readily proved on taking the vector a = (#*•) 
as one of the co-ordinate axes in system space. The positive 
definite Hermitian forms A with unit trace, i.e. the statistical 
aggregates, constitute a convex region 3 in the sense that with 
A and B their centre of mass ” \A + pB (A, fx arbitrary positive 
numbers whose sum is unity) belongs also to 3. A point of © 
which cannot be considered as such a centre of mass of two 
points of © distinct from the point in question is called, following 
Minkowski , an “extreme point." 16 © is the “convex core ” of 

the class 6 of all extreme points, i.e. it is the smallest convex 
domain which includes all the points of We cannot dispense 
with a single extreme point of © ; if we leave out but a single 
point of ® the entire convex core shrinks together. We may 
accordingly characterize the pure states as the “ extremes " among 
all the possible statistical aggregates. 

It is often convenient to dispense with the normalization 
tr A = 1 ; (7.3) then gives the relative rather than the absolute 
probabilities. The simplest statistical aggregate is that one 
characterized by the unit Hermitian form with matrix 1 ; it 
represents total ignorance . In thermo-dynamics the important 
role is played by the canonical aggregate A = e~ H i ke ; H is here 
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the Hermitian form which represents the energy, k the Boltzmann 
constant and the number 9 the temperature. 19 


§ 8. The Dynamical Law. Transition Probabilities 

Having considered the general probability laws of the quantum 
theory, we now turn to the dynamical law governing the change 
in the state £ of a physical system during an interval dt of time. 
The dynamical law states that this change is effected by 

idt 

the infinitesimal unitary operator — • H ) where H is the 

Hermitian form which represents the energy : 




(8.1) 


The peculiar significance of the energy in quantum mechanics 
is due to its appearance in the dynamical law. We also consider 
this law as a fundamental axiom of quantum theory of universal 
validity. For the matrix X : 

*ih = %i$k, 

which characterizes a statistical aggregate of the pure state 
described by the vector £ = (x t ) [cf. eq. (7.2)], we obtain the 
equation 

if-XH-H* (8.2) 

on applying (8.1) and taking into account the fact the H is 
Hermitian. This same equation also governs the change in 
time of a statistical aggregate X for a mixed state. 20 

For the integration of (8.1) it is convenient to choose as our 
co-ordinate system the characteristic vectors of H ; the corre- 
sponding characteristic numbers E n are the energy levels. We 
call this particular system the Heisenberg co-ordinate 
system, as Heisenberg tacitly employed it in his fundamental 
paper on quantum mechanics. This Heisenberg co-ordinate 
system is in general not uniquely determined ; the essential 
point is the decomposition of the system space 91 into the 
characteristic sub-spaces 9t' = $(&), 91" = ^(i?"), • • • as- 
sociated with the various characteristic numbers £', E" \ • * *. 
The states represented by vectors £ in such a characteristic 
space are called quantum or stationary states ; in them the 
energy has a sharply defined value. The cases in which H 
possesses only discrete characteristic numbers include “ con- 
ditionally periodic motion,” the only ones for which the older 
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quantum theory could be formulated. The nomenclature and 
symbolism employed in the following is adapted to discrete char- 
acteristic spectra, but this by no means precludes the possibility 
that the spectrum is entirely or partly continuous. Equation 
(8.1) becomes, on resolving it into components with respect to 
Heisenberg’s co-ordinate system, 


hdx n 
i dt 


+ E n % n = 0 


and has as solution 


X n (t) = x n ■ e-'V (E„ = hv n ). (8.3) 

This is an explicit formulation of the unitary transformation 
? j(<) = U(t)i which the state vector j undergoes in time t. 

Since |* n (f)| 2 is constant, the probabilities for the various energy 
values do not change in the course of time. The finite law 

X(t) = U(t)XU-ft) (8.4) 

for the dependence of the statistical state X(t) on the time t 
is fully equivalent to the differential law (8.2). 

The mean value q = q(t) of the physical quantity represented 
by the fixed Hermitian operator Q : 

q(t) = tr [X(f) • Q] 


can, on taking into account the symmetry properties of the 
trace, be written also in the form 

q(t)=tr[X-Q(t)] 

where 

5(f) = U~\t)QU{t). (8.5) 

Consequently the situation can be described either by con- 
sidering Q as fixed for all time and the statistical state X(t) as 
varying with the time in accordance with the law (8.4) — and 
this is the fundamental stand taken by quantum mechanics — 
or we can take the initial state X as representing the state of 
the system for all time and allow the operator Q(t) representing 
the quantity Q to vary with time in accordance with the law 
(8.5). This latter interpretation lends itself to comparison with 
classical mechanics. (8.5) is equivalent to the differential law 

rf -no -OH, 

for in virtue of (8.2) and (8.6) 

§=«(§• < 0 =*,(*•§)• 


(8.6) 
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In particular , the quantity Q is constant in time , i.e. the P r °^ 
abilities associated with it do not change in course of time , if the 
Hermitian form Q which represents it commutes with H . 

In Heisenberg’s co-ordinate system equation (8.5) becomes 

?«n(*) = Iran * £-%*»-***• (8.T) 

The matrix Q(t) is thus expressed in terms of components per- 
forming simple oscillations with frequencies v m — y n . THe 
corresponding amplitude is 0 mn . On going over from the m tfk 
to the n ih stationary state the system loses an amount h(y m — 
of energy ; if this energy is radiated as light, its frequency 
is given by 

v mn = V m — V n . ( 8 . 8 ) 

Classical mechanics collects together all the transitions from 
a fixed level m to all possible levels n = 1, 2, • • - into a single 
state of motion, the motion of the system in the m ttl quantum 
state, whose harmonic components have the corresponding 
transition frequencies v ml) v m2) • • *. For any quantity A it 
therefore associates a constant amplitude a mn with the transition 
m w. But in classical mechanics (for systems with one degree 
of freedom) we have 

v mn = k • a }(»), k = m — n, 

instead of equation (8.8). On multiplying the two Fourier 
series A , B 

Zab-J** and £b k -j*** 

k h 

we obtain the Fourier series C with coefficients 
Cic = £a r b 8 ( r + s=k ). 

Accordingly classical mechanics associates with the quantity 
C = AB the amplitudes 

c mn = 2Xn, tn-r * b m , (r + S — M — »), (8.9) 

whereas quantum mechanics assigns to it the amplitudes 

Cmn jE&mt ^tn " m-r * n • (8.10) 

The difference between these two results lies 'in the fact that in 
(8.9) both factors a ) b have the first index m in common, whereas 
in (8.10) the first index of b is the same as the last index of 
This is in exact analogy with the difference between the “ classical ** 
and the correct Ritz- Rydberg combination principle . This was 
Heisenberg’s starting-point ; the correct combination principle* 
indicates the pertinent .fact that the rule (8.9) for the multi- 
plication of amplitudes must be replaced by (8.10). Admittedly 
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such multiplication is not commutative, and it collects together 
amplitudes which the older model assigned to different orbits. 

We denote |tf ww | 2 as the intensity of the quantity A in the 
transition m -> n. When multiple energy levels occur ( u de- 
generacy ”) only the sum Z\ a mn\ 2 , extended over all indices 
m for which E m = E and all indices n for which E n = E'\ 
has an invariantive significance ; in such a case this sum is 
taken as the intensity of A in the transition E' E". If A * 
is that portion of A in which 9i( E ) intersects 9t(E") the sum 
defined above is the trace of A*A*. 

Consider an atom with one or more electrons and let t be 
the vector from the nucleus to a representative electron. Then 
q = ex, or in case there is more than one electron the sum 
q = £el, extended over the various electrons, is the electric 
dipole moment of the atom. In classical electrodynamics the 
intensity of the light of frequency v emitted by the atom is calculated 
from the amplitude q(v) of the harmonic components of q with 
the same frequency v in the following manner. f The rate at 
which energy flows through a surface element do at the point P, 
whose distance from the atom at 0 is large compared with the 
wave-length, is given by 




where q 1 is the component of q perpendicular to OP and du> is 
the solid angle subtended at 0 by do. We have further assumed 
that the wave-length under consideration is large compared with 
the radius of the atom. Since each photon of frequency v 
carries with it energy hv, we postulate that this law is to be 
taken over into quantum theory as follows : the probability 
that an atom in state n goes over into state ri in unit time and 
emits a photon of frequency v, whose direction lies within the 
solid angle do, is given by 


I inn 


( 2 . 


277k 3 


da). 


( 8 . 11 ) 


We thus arrive at a definite rule for the calculation of the intensities 
of the lines emitted by the atom. The fact that we can now make 
such a prediction indicates a distinct superiority of the new 
theory over the old. In particular, the transition n-*n' does 
not occur if the corresponding coefficient in the Hermitian form 

t By this we mean that the terms c\(v)e iPt -f q(v)e~ iPt occur in the harmonic 
analysis of q. 
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for q is zero. This constitutes the general selection rule. The 
connection between the state of polarization of the emitted, 
light and the direction of oscillation of the electric moment is 
also carried over into quantum theory. But a real derivation, 
of our intensity rule can naturally only be obtained by con- 
sidering the question of interaction between the atom and the 
ether ; see § 13. 


Examples : 1. The Oscillator. 

The Hermitian form 

+ 00 

f * f>(x) <P(x) dx, 


representing the co-ordinate x of the oscillating particle has, 
as we have already found [(3.11)], the coefficients 


?««' = 0 if n' + n ± 1 ; 


2mco 


( 8 . 12 ^ 


with respect to Heisenberg’s co-ordinate system, in which the 
energy is referred to its principal axes. We thus obtain the 
selection rule the quantum number n can only change 

by ± 1, the oscillator then absorbing or emitting a photon of fre- 
quency v — <n and energy ha>, in accordance with (3.10). The 
selection rule makes it clear why no higher harmonics are ex- 
cited in the simple oscillator. We have also found that the 
matrix ||/> nn <||, which represents the linear momentum in Heisen- 
berg’s co-ordinate system, is given by (3.12) 


P, 


' B - 1- 


km cun 


i Pm 
for 


1 /A 

w- 


moj(n + 1) 
2 


1 


(8. IS) 


2 

Pnn' = 0 for n' 4= n ± 1 
2. Electron in spherically symmetric field. 

The result (4.7) for surface harmonics yields the selection rule 


l-+l± 1 (8.14) 

for the azimuthal quantum number l ; for 1 = 0 only the transition 
0 ~ ^ 1 is possible. On introducing the magnetic quantum number 
m as in § 4, the characteristic functions depend on the 

meridian angle tf> about the z-axis only in the multiplicative 
factor e im * ; here 

x ± iy = r sin 6 • e ±i * , z = r cos 9. 
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In order to obtain the dependence of the matrices q x -f- iq y> 
q x — iq y , q z on the transition m m! we must evaluate the 
integral 

2rr 

J e(a<l> ) e(— m</>) e(m'<f>) d(j > , 
o 

where a = 1, — 1, 0, respectively. The integral vanishes 
unless m f -f a = w. The only components of q x + iq y which do 
not vanish are those corresponding to the transitions m -> m — 1 
in which the magnetic quantum number decreases by 1 ; for 
q x — iq y , m -> m + 1 ; for q Z) m-*m. 

This last selection rule cannot be obtained from the spectra 
themselves as long as the terms corresponding to different 
values of m (\m\ ^>1) coincide. But these terms are broken 
up into their various components by a homogeneous magnetic 
field in the direction of the 0 -axis [Zeeman effect). On “ longi- 
tudinal ” observation of the light emitted in the 0 -direction we 
find instead of the one line [n, l ) ->* [n f , V) several left- and right- 
circularly polarized components, the former of which arise from 
the transitions m -> m — 1 and the latter from m -* m + L 
On “ transverse ” observation, e.g. along the y-axis, we find 
two transverse linearly polarized lines arising from m-> m ±1, 
and in addition a longitudinally (i.e. along the 0 -axis) polarized 
line corresponding to the transition ra -> m. (Polarization as 
here used means the direction of oscillation of the electric dipole, 
and therefore the direction of the electric field strength.) 

In the term spectrum of the alkali elements , which is, however, 
typical in this respect, even for the more complicated spectra 
of the other elements, we distinguish between several series by 
means of the letters s, p, d, f, g } ■ • \ Each series consists of 
infinitely many terms which we number in the direction of 
increasing frequency by the integer n. It is found convenient 
to let n run from 1 on in the ^-series, from 2 on in the ^-series, 
from 3 in the ^-series, etc. The values of the terms ns, np, 
nd, * • • are then given by the “ hydrogen-like ” formula 

_ R 

(n + k) v 

in which k = k s , k P) * * * is a correction term depending but 
slightly on n, the numerical value of which but rarely exceeds 
1/2 and is very close to 0 for high series (/, g, . . .). Only terms 
lying in neighbouring series combine to produce a line , i.e. an 
s-term combines only with a p- term, p only with s and d , d with 
p and /, etc. In particular, the transitions np -> Is give rise 
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to the principal series , which also appears in absorption, nd -* 2 p 
to the lines of the diffuse series , ns 2p to the sharp series , 
and nf-r 3<Z to the Bergmann series* 1 

The alkalies are univalent, i.e. in chemical reactions only 
electron, the valence electron, plays a r61e ; the others, 
together with the nucleus, constitute an inert closed shell It 
is therefore reasonable to assume that the optical spectra of 
the alkalies are caused by quantum jumps involving only this 
valence electron, while the core A + remains in its normal state. 
We have seen above that hydrogen in the normal state is re- 
presented by a spherically symmetric wave function ip ; we 
therefore assume, disregarding the reaction of the valence 
electron on the core, that this feature of the core being 14 closed ” 
is to be expressed by ascribing spherical symmetry to it.* We 
have then to deal with the problem of an electron in a spherically 
symmetric field, which we have already discussed above. In 
accordance with the empirical combination principle and the 
theoretical selection rule for the azimuthal quantum number Z, 
the s , p } d, /, * • • terms are to be taken as having l = 0, 1, 2, 3, 
• • • respectively, n then runs from l + 1 on in the series with 
azimuthal quantum number Z, as in hydrogen.** 


§ 9. Perturbation Theory 

The problem with which perturbation theory is concerned is 
the following : Let the energy H consist of two terms H=H+eW ;r , 
the second of which, the perturbation term eW , is small compared 
with the first ; this we express by the “ infinitesimal ” numerical 
constant s, of which powers higher than the first are to be 
neglected. Assume that the quantum problem for the “ un- 
perturbed system ” with energy H has already been solved, so 
that the Hermitian form H has already been brought into 
normal (diagonal) form, and let 91', 91", • • • be the character- 
istic spaces of H with characteristic numbers £', • • •. The 

problem is to find the solution of the equations for the “ per- 
turbed system ” with energy H. 

In order to illustrate the typical difference between degenerate 
and non-degenerate systems we first consider the system space as 
2- instead of oo -dimensional ; then 




+ eW. 


* Why He and not H is the first closed atom is only to be understood as 
the result of a profound modification of wave mechanics ; see Chap. IV. 

** Concerning the introduction of the u true quantum number ** for 
elements other than hydrogen, see Chap. IV, § io. 
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If E x =4= E t the unitary transformation which brings H into 
diagonal form differs from the identity only by terms of order s. 
Consequently the probabilities l^] 2 , \x 2 \ 2 that in the pure state J 
H has the values E x , E z will change only by amounts of the 
same relative order e ; they remain constant to the same ap- 
proximation with which eW may be neglected in comparison 
with H . But the situation is quite different for degenerate 
systems, for which E x = E % == E, for the principal axes of H 
are then indeterminate and this arbitrariness is expressed in 
the “ instability"’ of the system under the influence of a per- 
turbation. We set up that normal co-ordinate system e/, e/ 
in which W assumes the diagonal form ; the co-ordinate vectors 
are then also characteristic vectors of H, since E x = E 2 . But 
these vectors can obviously differ arbitrarily from the original 
co-ordinate vectors e 1} e 2 , whereas the energies hv x , hv 2 ' can only 
differ from E by a term of order e. On returning to the original 
co-ordinate system we have 

* «(— v i l ) + *12 ' «(— H '<), 

^21 * e ( ^1 1 ^22 * ^ ( ^2 J 

where flj — (* u , fl a ), a 2 = (a 12 , a 22 ) are two mutually per- 
pendicular vectors whose directions coincide with those of e/, e 2 '. 
The probabilities for the two states e x , e 2 vary periodically in 
time with the small beat frequency vf — v-l (resonance between 
states c 1} e 2 ). Quantum states with the same energy are therefore 
in resonance with one another. The magnitudes of the components 
of l in the characteristic spaces 9?', 31", • • •, i.e. the probabilities 
for the various numerically different values of H remain ap- 
proximately constant under a small perturbation, but this is 
not the case for the absolute values |#„| of the individual com- 
ponents x n resolved along the axes of an arbitrary Heisenberg 
co-ordinate system of the unperturbed system. 

In accordance with the foregoing we can formulate the 
perturbation problem in two forms : I. Determine the change, 
due to the perturbation, in those states in which the energy 
H of the unperturbed system is determinate. This formulation 
has a sound physical interpretation if we consider the perturba- 
tion as acting during a time interval t lt f 2 . We then find how the 
probabilities for the various quantum states change under the 
influence of the perturbation , i2 II. Determine the quantum 
states and energy levels of the perturbed system, i.e. the char- 
acteristic values and characteristic spaces of H. We ask in 
particular how the terms are broken up and displaced under the 
perturbation. We consider II first, 



88 QUANTUM THEORY 

We first decompose the Hermitian form W into two parts : 
W 0 + V . To the first belong those portions of W in which 
a characteristic space $R', SR", • • • of H intersects itself, and to 
V those in which two different characteristic spaces intersect. 
If the characteristic values of H have but finite multiplicity 
the problem of bringing W', that part of W in which SR' intersects 
itself, into diagonal form deals only with the space SR' of a finite 
number of dimensions. If SR' is not simply a one-dimensional 
space, the resonance phenomena mentioned above will appear. 
The co-ordinate system, consisting of characteristic vectors of 
H, is now more precisely specified, for now W 0 also appears as 
a diagonal matrix ; let E n be the characteristic values of the 
H + eW 0 = H 0 so obtained. The single term value E ' asso- 
ciated with SR' has in general been resolved into as many different 
characteristic values E n of H 0 as there are dimensions in the 
sub-space SR'. 

The remainder V = ||v mw || of the matrix is such that v mn = 0 
if the characteristic values E m , E n of H are equal. The in- 
finitesimal unitary rotation 

Sx = e-Cx, C — ||c mn ||, 

of order e transforms H into H + 8H where 

8H = e(H C - CH) ~ s{HC - CH). 

On choosing this transformation in such a way that 8H = — eV, 
H = H 0 + *V goes over into H 0 ; this can be accomplished by 
choosing c mn = 0 if E m = E n and 



otherwise. The characteristic values E n of H 0 are therefore the 
energy levels of the perturbed system of energy H if we neglect terms 
of order e a . 

W 0 can be considered as the time mean of the perturbation 
terms, averaged over the motion of the unperturbed system. 
For by (8.7) the mean value of the element a mn (t) of the matrix 
A(t), which represents an arbitrary physical quantity of the 
system, is a mn or 0, according as v m = v n or not. In statistics 
angular brackets are often used to denote the mean value of 
a quantity ; we may therefore write 

IF 0 =<PF>, H 0 = <H>. 

The solution of II naturally provides an answer to the 
question I. But it is more convenient to employ the method of 
variation of constants for the calculation of the effect of the 
perturbation over a limited time interval — the smaller the 
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constant e, the longer we may take this time interval to be. 
Assume that at time t = 0 the system is in the quantum state 
0 and that the perturbation begins to act at this time ; we ask 
for the probability that the system will be found in the state 
n at time t . That is, we seek that solution of the equations 

- Yif = *" + f £ W nm x m (» = 0, 1, 2, • ■ •) 

which reduces to 

x Q — 1, x l = x 2 — ■ • • = 0 
at time t = 0. Writing 

‘ e(— V n t) 

the equations for are 

— - £ = - T W £ e^n ~ "mV • 

■bn ? ” nm£m e j 

t « m 

for e = 0, £ n = 0. Neglecting terms of order e 2 , we can take 
the initial conditions 


£> = i » & = £2 = •••== 0 


as the 0 th approximation ; on substituting these values in the 
equation we obtain as the first approximation 


eW nl) e " i( \ - v nV — 1 

h ^0 — v n 


K =t= v o)- 


On setting v 0 — v„ = v, the desired probability is 

= 2[1 W1 |H..|-. (9-1) 


It is to be noted that in accordance with this result the probability 
of transition from state 0 to state n is determined by |H 0 «| 2 . In 
the case of resonance (y n = v 0 ) the transition probability in- 
creases at first with the square of t : 

w-GD’-W 1 ’- 


§ 10. The Problem of Several Bodies. Product Space 

A physical system consisting of two particles of masses m , m f , 
co-ordinates xyz; %' y' z’ and linear momenta |), has as 
its Hamiltonian function 

H “ i M + A + #3 + »(#■? + P? + #7) 

+ V(xyz; x' y' z'), (10.1) 
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where V is the potential energy. We assume, as in the older 
physics of central forces, that we are here dealing with an action 
at a distance so that the potential energy depends only on the 
simultaneous positions of the two particles. This assumption 
naturally breaks down when, in accordance with the theory of 
relativity, we take into account the finite velocity of propaga- 
tion of the disturbance, which requires the introduction of a 
field. The wave function ip of the system will depend on all 
six co-ordinates xyz ; %' y' z' in addition to t ; the operators 
corresponding to these functions in the domain of such functions 
tp are multiplication by x, • • • ; *',•••, and to the linear 


momenta correspond the derivatives j • 
From (10.1) we then obtain the wave equation 


h t) - 

t lx" ' * '■ 


h tip 

IF 


. b? t » . h l . 

+ 2 ^ A ^ + 2 ^ A ^~ 


V -ip = 0. 


( 10 . 2 ) 


We must ask for the probability that the one particle is to be found 
at a point P and } simultaneously , the other is to be found at a 
point P\ The probability density is accordingly to be computed 
for a 6-dimensional space with co-ordinates xyz ; x f y r z r . 
Indeed, the wave field is not to represent directly occurrences 
taking place in physical space, but is to determine the appear- 
ance at definite positions or with definite energies and momenta ; 
there is consequently nothing absurd in the fact that its medium 
is this abstract 6-dimensional configuration space. 

In order to be independent of the special procedure by which 
the scalar wave mechanics puts together two systems a, 6 to 
form a single system c, as suggested by this example involving 
the Hermitian forms representing the co-ordinates and momenta 
of the two systems, we must first discuss the multiplication of 
spaces from a purely mathematical standpoint. 

With each vector j — {%+) in a space 91 of m dimensions and 
each vector t) = (y k ) in a space © of n dimensions there is 
associated a vector J = £ X t) with components 

Zik = x { y k ( 10 . 3 ) 

in an m • n- dimensional space % = 9t X ©, the product space » 
The components are here numbered by means of the index 
pair ( ik ) = /. The totality of vectors } = j X t) do not them- 
selves constitute a linear manifold, but their linear combinations 
fill the entire product space %. With the linear correspondences 
A in 91 and B in © : 
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is associated a linear correspondence C — A X B in X : 

4 4 = 2X'» b k , k x t y k , 

i,k 

or 

4 = I c viZir cm = *,'<&*'* [Z = (*&)> Z' = (i'fe')]- 

I 

Naturally, to this multiplication corresponds the law of com- 
position 

(A x B)(A 1 X 5 X ) = (^ x X 5^), 

where X!, are correspondences of SR on itself and 5, B x are 
correspondences in @. A co-ordinate system in SR and one in 
© together determine a co-ordinate system in % ; if the co- 
ordinate system in SR is subjected to the transformation A and 
that in © to the transformation 5, then the co-ordinate system 
associated with them in % undergoes the transformation A X 5. 
In accordance with the equation 

d[x t y k ) = dXi • y k + x { • dy k , 

to the infinitesimal correspondence H in SK, J in <3 corresponds 
the infinitesimal correspondence 

(■ H X 1.) + (l r X J) (10.4) 

in where l r , denote the unit matrices in SR, ©, respectively. 
All of the foregoing is applicable to arbitrary vector spaces. 
When SR and © are both unitary spaces, then % is also, for by 
(10.3) 

• IVkJk 

is an invariant if 5^^, Hy k y k are ; A X 5 is unitary if A and 
5 are. 

Accordingly, two physical systems a and 6 are compounded 
to form a total system c as follows. The system space X of C 
is SR X @, where SR is the system space of ft and © of ft. Let 
the arbitrary physical quantity oc in SR be represented by 
the Hermitian form A ; on replacing all these forms A by 
A X 1*, where l s is the unit form in an arbitrary space @, there 
exist between these latter exactly the same relations as between 
the A y so that from the solution of a quantum problem in SR 
there arises a solution for the corresponding problem in SR X ©, 
but there exists no real distinction between the two. In the 
system c obtained by composition we have therefore to as- 
sociate the Hermitian form A X l s with a quantity oc of a and 
l r X 5 with j8 of ft, where A, B are the forms associated with 
a, in SR, ©, respectively. The totality of quantities of the 
composite system C is obtained by starting from the quantities 
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belonging to the component systems a and 6 and multiplying 
and adding them together in all possible ways. The quantities 
a of ft commute with the quantities of 6, for 

(A X l,)(lr X£) = ^XS = (1,X B)(A X l s ). 

We refer to the content of these last two sentences when we 
say that C consists of two kinematically independent parts ft and f>. 

The two systems are dynamically independent if the energy 
H of the composite system is the sum of the energies HW 
of the partial systems : 

H = (HO X 1) + (1 X H<*>). 

The infinitesimal unitary correspondence ^ • H in the total 

system space is then that one which is due to the infinitesimal 

unitary correspondences rr • Hf 1 ), ^ ■ H< 2 > in the two original 

system spaces [(10.4)]. If HO) and H* 2 ) are both in diagonal 
form, then H is also, and the characteristic numbers are given 

E l = E[ 1] + or „, = „<’> + „[?) [l = (ik)} 

If we have a pure state for the total system which is repre- 
sented by the vector c of absolute value 1 and components 
c ilC} and if Q — ||^tv|| is an arbitrary quantity in a , then the ex- 
pectation of Q in the pure state c is 

(Q} = 24 ii'$kk'Cik c i f k f ' 

This has the form (7.2) with 

A = ||a«>|| = \\Zc ikCi , k \\. 

Afe) is the Hermitian form 

zlzwiY 

k i 

in SR. But we see from this that we are not dealing with a pure 
state in a, for a iV will not in general have the form a^u Con- 
ditions which insure a maximum of homogeneity within t need 
not require a maximum in this respect within the partial system it. 
Furthermore : if the state of a and the state of ft are known , the 
state of c is in general not uniquely specified , for a positive definite 
Hermitian form || a ikl f /*/|| in the product space, which describes 
a statistical aggregate of states c, is not uniquely determined by 
the Hermitian forms 

m&ik, i'ki E a ik , ik f 
k i 
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to which it gives rise in the spaces SR, <5. In this significant 
sense quantum theory subscribes to the view that “ the whole 
is greater than the sum of its parts which has recently been 
raised to the status of a philosophical creed by the Vitalists 
and the Gestalt Psychologists. 

The kinematically independent parts into which a system 
can be resolved need not be spatially separated, nor need they 
even refer to different particles. We can, for example, resolve 
a single particle, whose physical quantities can all be expressed 
in terms of %, y, z ; p Xi p y , p Z) into three partial systems with 
fundamental quantities p x | y } p y j 0 , p z . For quantities 
which belong to different partial systems, for example a quantity 
which can be expressed in terms of p x alone and one which 
is in terms of y, p y alone, commute with each other in the sense 
of matrix multiplication. 

In the perturbation theory we are usually concerned with a 
system which consists of two kinematically independent parts 
and which are almost dynamically independent. Disregarding 
the interaction bW for the moment, let hv n and hp r be the energy 
levels of the two parts, so that h(v n -f- p r ) are the energy levels 
of the unperturbed total system. On writing in equation (9.1) 
$ = (n, r) in place of 0 and s' = (»', r f ) in place of n, whence 

v = O'* + pr) — K' + Pr) = Vnn' + Prr' \ 

Vjm' ^ V n JV, p rr t = p r p r 

we find as the probability that the total system goes over from 
the state $ to the state s' during time t : 


2 /e\ 2 1 — cos (y 4- Prr ,)t 
\h) (v nn , 4- p rr ,y 


W(nr , n'r') 2 . 


(10.5) 


The probability that the first system will be found in the state 
n' after time t } the total system having been in the state s = (nr) 
originally, is obtained from (10.5) by summation with respect 
to /. 


§ 11. Commutation Rules. Canonical Transformations 

The development of wave mechanics in §§ 1-3 went beyond 
the general scheme of §§ 7 and 8 in that it employed certain 
specific Hermitian forms to represent the co-ordinates and 
momenta of the particle. We are now interested in seeing how 
this can be formulated in an invariant manner, without recourse 
to any special co-ordinate system in system space. 

For the Hermitian forms q 1 p representing a rectangular 
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co-ordinate and its associated momentum we postulate the 

commutation rule 

ti- » -it (n.i) 

If the system has only one degree of freedom, these two quantities 
appear as canonical variables in classical mechanics. All physical 
quantities of the system are then functions of p and q ; in order 
to avoid complications we restrict ourselves to polynomials f in 
p and q y and assume, in particular, that the Hamiltonian function 
H has this form. What are we to understand by the derivatives 
f p and f t of / with respect to p and q in this domain in which 
p and q are not commutative in multiplication ? We should 
in any case require that differentiation with respect to q should 
obey the following postulates : 

(1) p q = 0, q Q = 1 ; 

(2) if + g)n = fn + Sa and ( a /)« = a " /a, where a 1S a number ; 

(3) (/*).=/.•*+/•&. 

We see immediately that these conditions uniquely determine 
the derivative of a polynomial /, unless they happen to lead to 
contradictions. But that they do not lead to contradictions 
can be seen from the fact that they are obeyed by the definition 

ih'f<=fP ~ Pf- (11-2) 

(1) follows immediately from the commutation rule (11.1), and 
the linearity (2) of the process is evident. (3) is proved by the 
formula 

(fg)P - Pifg) = figp - pg) + Up — Pf)g 

which involves only the distributive and associative character 
of matrix multiplication. Similarly we can show that 

— ik • f p = fq — qf. (11.2) 

The fundamental dynamical law gives us the equation (8.6) : 


for any Hermitian form /. On applying this equation to p and q 
— which obviously suffices to establish the corresponding result 
for any polynomial / of p and q — and comparing it with the 
formulae (11.2) applied to the particular function H, we are led 
to the familiar Hamiltonian equations of classical mechanics : 



pf 



(11.3) 
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It is a universal trait of quantum theory to retain all the relations 
of classical physics ; but whereas the latter interpreted these re- 
lations as conditions to which the values of physical quantities were 
subject in all individual cases } the former interprets them as con- 
ditions on the quantities themselves , or rather on the Hermitian 
matrices which represent them. This is the more significant 
formulation which the new quantum theory has given Bohr's 
correspondence principle. 

The commutation rule (11.1) is of a rather remarkable 
nature. It is entirely impossible for matrices in a space of a 
finite number of dimensions, and it alone precludes the possi- 
bility that in an oo -dimensional space q (or p) have only a discrete 
spectrum of characteristic numbers. For on referring q to its 
principal axes 

q = ||?mn||) #nn = Q mn =0 (fft =)= n ) j P = ||^*nn||> 

the left side of the commutation rule has the components 
pmd&n — ?m) i hence the main diagonal consists of nothing 
but zeros ! The question arises as to whether it can be con- 
cluded from (11.1) alone that the forms representing q and p 
can always be given the form 

+ 00 +00 

J* ${x) >/j(x) dx, J fix) • J^x dx 

“00 “00 

for an arbitrary vector p with components ip(x) on employing 
an appropriate co-ordinate system in system space. We shall 
see in Chap. IV, § 15, that, on introducing a certain irreducibility 
condition, this is in fact the case. 

On taking into account the three space co-ordinates q a and 
their associated linear momenta p x (a = 1, 2, 3), we have in 
place of the one commutation rule (11.1) the following : 23 


P*Pp ~~ PfiP* = °i 


q fi q a = 0 for all a, ]8 ; 

a (« = /?) 

~~ \0 (a*/?)- 


The same commutation rules apply to the case in which we have 
several particles, the only difference being that then oc runs 
through 6, 9, • * • values, according to the number of particles, 
instead of 3. These commutation rules are the necessary and 
sufficient condition that the dynamical law, which governs the 
time rate of change of the state vector % in system space, leads 
to the Hamiltonian equations for the “ canonical variables ” 
q X} p a representing the co-ordinates and associated momenta of 
the various particles composing the physical system — whatever 
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the dependence of the Hamiltonian function H on these quantities 
may be. 

In classical mechanics the Hamiltonian equations are invariant 
with respect to canonical transformations , 24 In a system of 
/ degrees of freedom the transition from a set of variables q K} p„ 9 
describing the state to a set q a) p' a (a = 1 , 2, • • •, f) is a 
canonical transformation if the difference 

£p'M — EPM* (11.5) 

is a total differential. If, for example, the q a are subjected to 
a transformation 

••■?/) 

among themselves, the p a must transform as the components 
of a “ covariant vector” in #-space in order that the whole be 
a canonical transformation (“ extended point transformation ”) : 



Perhaps the simplest canonical transformation is that in which 
the roles of q and p are interchanged : 

= — <1*, = P** 

The canonical transformations constitute a group [cf. Ill, § 1], For 
the identity, i.e. the transition from (p , q) to ( p , q) f is a canonical 
transformation ; the inverse ( p\ q') -> (p , q) of a canonical 
transformation (p } q) (p' t q') is also canonical ; and from the 
canonical transformations (p 9 q) -> (p\ q') y (p' y q') -> (p" } q ") 
it follows that the resultant transformation {p , q) -v (p n y q") 
is also canonical, for if 

zM*-2:pjq n zpw; - sp'm 

are total differentials their sum 

ZfX - ZpAa 

is also. 

An infinitesimal canonical transformation is one in which 
p\ q* differ infinitely little from p } q . We can consider it as 
an infinitesimal deformation of the 2/-dimensional (/>, #)-space 
which takes place in th6 infinitesimal time interval e = St. We 
introduce the components S p } 8 q of the displacement vector by 
means of the equations 

K - Pa = 6 • 8 Pa, 9l ~ 9« = 6 • %. * 
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Since (11.5) must be a total differential, 

ZP'« + Zq x dp „ = dT (11.6) 

must also ; in our case T must differ only infinitesimally from 
2p«q«- We may therefore write 

T =Zp«q:-eS; 

<X 

considering 5 as a function of p a and q' we have, in accordance 
with (11.6), 




or 


*s 


q “- q ° 

1 V* 

iS _ 

as 

II 

1 ^ 

1 

II 

l>Px 


(11.7) 

Since we may legitimately neglect terms of order £ 2 , we may 
identify q K with q a on the right-hand side of these equations. 
We call S the generating function of the infinitesimal canonical 
transformation . 

In accordance with the Hamiltonian equations, the state 
of a system, represented by a point (p, q ) in (p ) #)-space, goes 
over into a state (p dp, q + dq) during time dt. If we follow 
this transition for all possible initial states (p, q) we obtain an 
infinitesimal deformation of the space whose points represent 
the state of the system. The Hamiltonian equations assert that 
this deformation is an infinitesimal canonical transformation with 
generating function H * dt . It follows from this without any 
calculation that these equations have a significance which is 
independent of any particular choice of canonical variables. 

Now in quantum theory the Hamiltonian equations (11.3) 
assert that the state vector $ in system space undergoes the 
infinitesimal unitary rotation 

H = - ■ H h (8.1) 

so the infinitesimal canonical transformation of the quantities 
p, q is here obtained by subjecting the argument J in the Her- 
mitian forms representing them to the infinitesimal rotation 

e.8|=-|-5 X . 

We find that the increments of the quantities p x , q a are in fact 
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and, in virtue of the commutation relations (11.4), this agrees 
exactly with (11.7). On generating a finite canonical trans- 
formation by the successive application of an infinity of in- 
finitesimal ones we arrive at the result that the unitary corre- 
spondences of system space on itself in quantum theory : 

?'= ui 

correspond to the canonical transformations of classical mechanics ; 
more precisely, only those for which the matrix U is expressible 
in terms of the matrices p , q , but we may for the present pass 
over the question as to whether every matrix U can be obtained, 
or at least arbitrarily closely approximated, in this way. Since 
the commutation rules (11.4) remain unchanged under rotations 
of the normal co-ordinate system, they are valid for an arbitrary 
set of canonical variables. This is also evident from the fact 
that they are the conditions that the dynamical law (8.1) lead 
to the Hamiltonian equations 

dq* __ dp* __ __ m o\ 

dt ipf dt c )q« ' 

The general procedure for the quantum mechanical treat- 
ment of a physical system suffers from the disagreeable fact 
that the expression for the energy in terms of the canonical 
variables must be taken from the classical model, and in ad- 
dition the transition to quantum mechanics is even then not 
unique, for the model offers no means of telling whether a 
monomial such as p 2 q is to be interpreted as p\ pqp , qp 2 or 
a linear combination of all three [cf. IV, § 14]. The provisional 
character of such a procedure is clear, but the results so far ob- 
tained seem to justify the hope that the path we have entered 
upon will lead to a unique formulation of the laws governing 
the actual physical phenomena. We need then concern our- 
selves longer with the general mechanical scheme. 

§ 12. Motion of a Particle in an Electro-magnetic 
Field. Zeeman Effect and Stark Effect 

Let the spatial co-ordinates xyz now be denoted by x x x z x^ 
and the time t by x 0 . If <f> is the scalar and c 21 the vector potential 
of the electro-magnetic field, then in the theory of relativity 

( (f> } 2I X , 21^, 21 z ) = (<£o> &Z) fit) 

are the components of a vector in the space dual to the 4-di- 
mensional world. Let 

F — . 

* lx. ’ 
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F 10 , F i0 , F 30 are the components of the electric field strength 
c(F 33 , F 31 , F 13 ) the components of the magnetic field strength 
Denoting the components of the velocity of a particle by 
v lt v 2 > v 3 , its proper time is 

ds — V dt 2 — (dx\ -(- dx l + dxfj /c 2 

=- dtV 1 — v 2 /c 2 (v 2 = v\ + v\ + vf ) . 

dx 

With the world vector u« = -j- - is associated the dual u * with 
components as 

u r = u r (r = 1 , 2, 3), u 0 = ~ c 2 u° . 


The invariant equations of motion for a particle of mass m and 
charge — e are 


ds 


e 2 


or 


d(mu t ) 

dt 


*(Fi 0 + E F ik v k ) 


(i = 1, 2, 3). 


( 12 . 1 ) 


The right-hand side is in fact the ponderomotive force 

-«(® + £[*$])■ 


These equations arise from the Hamiltonian function 


H 


Ho + C 


mV + E (Pi + e^) 2 , 

i = 1 


in which x 1 x 3 x 3 ; pip 3 p 3 are the canonical variables, 
the Hamiltonian equations 


yield 


s dxt _ = c(j>,- + e<k) 

‘ rff ap f v 


Pi + Hi = ; 


in the remaining equations 

dpi = _ ^ = _ | , ^ Hk Pk_+ H k 

dt ’ a^ \a»i *-i a* f 1/ 


( 12 . 2 ) 

In fact, 


the left-hand side is 

_ d(mUi) 

~ dt 


e {Hi _L y- Hi . 
\a* 0 a** 



But this is the desired equation (12.1) : 

dimut ) = _ f (Ho _ Hi) j_ y (Hie _ Hi) X 
dt Ha*, a x 0 J t=i \ix i a xj */' 
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The negative energy — H is the time component p 0 of the dual . 
vector whose space components are the components of linear 
momentum p = (p lt p 2) p B ), so the equation (12.2) can be written 
in the rational form 

4(£o + e<f> 0 ) 2 — Z [pi + e<f>i) 2 — m 2 c\ 

C i =» 1 

From this we obtain the simple rule : The influence of an electro- 
magnetic field on a particle of charge — e can be expressed by re- 
placing pc by pc + ecf)# in the equations of motion for a free particle . 
On going over to quantum theory p a becomes the operator 

i and is contragradient to the 4-dimensional displacement 

t oXc 

dXc , as is seen from the equation 

#= E ^r dx «- 

c OX x 

Our rule is now : On introducing a field of potential <f>„ 

^ c) 1 >€ 

- — must be replaced by |- r 6 x (12.3) 

OXc ox a h 

in the wave equation of the particle. Only f ifi has a simple physical 
significance ; it is therefore to be assumed that the laws which 
govern ift remain invariant on replacing <jj by e ix • \fs, where A is 
any real function of position in space-time. On the other hand, 
in the classical theory of the electro-magnetic field only the 
field strengths, and not the potentials, have an objective signifi- 
cance, i.e. the laws are invariant on replacing <f> a by <f> x — — , 

b% a 

where ft is also an arbitrary function of the x a . On examining 
our wave equation for these invariantive properties we find 
that it is not invariant under each of them separately, but that 
there must exist a certain relation between A and ju. The field 
equations for the potentials \fi and <f> of the material and electro- 
magnetic waves are invariant under the simultaneous replacement 

i/j by e iX -ft and <j> a by <f> a — - ^ • 

here A is an arbitrary function of the space-time co-ordinates. 
This “ principle of gauge invariance ” is quite analogous to that 
previously set up by the author, on speculative grounds, in 
order to arrive at a unified theory of gravitation and electricity. 88 
But I now believe that this gauge invariance does not tie to- 
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gather electricity and gravitation, but rather electricity and 
matter in the manner described above. We shall discuss this 
principle more thoroughly in Chap. IV ; its significance and 
its interpretation will then be more apparent. 

On passing to the limit c —> co in (12.2), after separating 
out the factor me 2 , we return to ordinary mechanics : 

H = ecj) 0 + cj^Zipi + e<f>i ) 2 . 

On neglecting terms which are quadratic in the <j> i} we find, in 
addition to the kinetic energy ZpijZm, the potential 

i 

V = -4 + ^m- (12.4) 

We have already made use of the first part, that due to the 
electric field, in § 5. If we have, in addition to the field originat- 
ing in the nucleus, a homogeneous electro-static field in the 
direction of the #-axis and of strength F, for which <j> = — F • z, 
it adds the perturbation term 

W = eF * z 


to the energy. A homogeneous static magnetic field ip is 

obtained from the vector potential c% = ~[ipt], r = (x, y , z) ; 

Hi 

this adds to the energy the perturbation term 


2 me 




2 me 




i.e. 


W : 


eh 

2mc 


:.m 


(12.5) 


Zeeman Effect . — If the homogeneous magnetic field strength, 
of magnitude |§|, is in the direction of the £-axis, the per- 
turbation term is 

W = ho ■ L z , 0 = e M. (12.6) 

On choosing the characteristic functions as our co-ordinate 
system in the system space of the functions W, as well as 
the energy of the unperturbed atom, is in diagonal form ; in 
the state defined by nl, m it has the value 

ho • m. 


(12.7) 
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The components (nl, m) -+ [nl\ m% consistent with the selection 

rule for m, into which the line with frequency v ^ (E nl E n ' i') 

is broken up give rise to but three lines: one corresponding to 
all the transitions m^m, which is linearly polarized in the 
direction of the axis and is undisplaced ; one which is circularly 
polarized perpendicular to the s- axis, the frequency v of which 
is displaced by + o 1) ; and one which is circularly 

polarized in the opposite sense, with frequency y — o instead 
of v (m -* m -j- 1 ). This normal Zeeman effect is found only 
in the so-called singlet lines. 

Stark Effect.— In accordance with the general perturbation 
theory, the displacement and resolution of terms in the presence 
of a homogeneous electric field is determined, to terms of first 
order, by the matrix 

eF • <s>. 

In consequence of the selection rule l -*■ / i 1 > (.%) — 0 , unless 
accidentally all energy levels whose azimuthal quantum numbers 
differ by 1 coincide. Ignoring this exceptional case, we should 
expect to find no 1 “ order perturbation effect increasing linearly 
with the field strength F ( linear Stark effect ), but only a quadratic 
effect, which is much smaller. This is in agreement with the 
experimental data on alkali atoms. Hydrogen is, however, 
degenerate, since for it energy levels with the same principal 
quantum number « and l — 0, 1, • • n — 1 coincide. The 
calculations for this case have been carried out by Schrbdinger 
and compared with experiment. 26 

§ 13. Atom in Interaction with Radiation 

Following Jeans, black body radiation is mathematically 
equivalent to a system of infinitely many oscillators. Maxwell’s 
equations for the free ether are 

div Sq — 0 , curl (5 + ~ ^ = 0 1 

div (5 = 0, curl £> — -^-7 = 0. 

’ v c It 

In order to simplify the relations, we assume that the walls o£ 
the radiation cavity of volume V are reflecting; then (S is 
perpendicular to the walls at the boundaries of the cavity. 
Since the black body is at rest it is of no particular advantage 
to carry through the calculation in a relativistically invariant 
manner ; we may therefore normalize the vector potential 
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in such a way that the scalar potential vanishes. We then 
have 6 = — — and the equations in the first row are satisfied 
by § = c * curl 21 ; the equations in the second row become 

div« = 0, A*-^«=0. 

On the boundary 91 is normal to the walls. Let the characteristic 
numbers and characteristic functions of the equations 

AH + £h = 0, div 2t = 0, 

with the boundary condition that 51 is there normal, be denoted 
by 

v ~ p x 0), 9t* [a = 1, 2, 3, • • •], 
normalized in accordance with 


On setting 


J(WiF - 


21 = 


where the coefficients q* depend on time but not on position, we 
find for them the equations 


dt 2 


dq a 


Introducing ~ = p a in addition to the q* ) this equation is 
that for an oscillator with Hamiltonian function 


we readily find on applying 

® — — £ p*% l«, Ip = c £ q“ ■ curl 9l a 

rt * 

that the energy of the radiation field is in fact given by 

V 

with this we have proved the theorem due to Jeans. For high 
frequencies p there are approximately 

Vp 2 dp 

TrV 


(13.1) 
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modes of oscillation in the frequency interval p, p + ^p- 27 We 
are interested above all in the limiting case of an infinitely large 
cavity ; the spectrum then becomes continuous and our formula 
for the density of frequencies becomes exact. 

On quantizing this mechanical system of infinitely many 
oscillators 28 in accordance with the theory of the oscillator (§ 3) 
and the process of composition (§ 10 — but cf. remark on p. 109), 
we find as possible quantum states s, each of which is characterized 
by the fact that in it there is associated with each index a an 
integer n a ^ 0, In this quantum state 

= h p a (n n "f" 2) 7 

or, on choosing the additive constant in the energy in such a 
way that the lowest energy value which the black body radiation 
is capable of assuming is 0, 

• hp a , H = iX ‘ h p*. 

a 

In the language of photons this means that when the cavity 
is in the state 5 it contains n„ photons of each kind a. The 
matrix element 

??»' = n i j ^2 , • "»**«i***J s = n \ I n z > ’ '» I ’ ‘ '] 

vanishes unless all the equations 

n[ = n 1} n' z = n 2 , n z — n Z) - • • 
hold with the exception of n' a = n„, which is to be replaced by 
«» = «»+ 1 or ri x —n a — 1. 

In the first case we have, by eq. (8.12), 

<f„. = (Emission), (13.2) 

and in the second 

<f„, = (Absorption). (13*2) 

y 2p* 

The first transition ^ s' consists in a photon of kind a springing 
into being, the second in the disappearance of one such photon. 
It follows from the above that in a transition for which q* t , 4= 0 
all other must vanish. 

Let an atom tvith fixed nucleus and electric dipole moment q 
interact with the radiation field . Differentiate the quantum 
states of the atom from one another by means of the index n 
and denote the corresponding energies by hv n ; then q = ||q nn /||. 
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A quantum state of the total system consisting of both atom 
and radiation is characterized by the quantum numbers 

The effect of the radiation on the atom is, in accordance with 
eq. (12.4) of the preceding paragraph, given to a first approxima- 
tion by the perturbation term 

eW = (q2l). 

It can be shown that the addition of such a term to the 
Hamiltonian function of the total system will, according to 
classical theory, not only indicate an influence exerted on the 
atom by the radiation field, but will also modify the equations 
of Maxwell in a way which indicates that the motion of the 
electrons in the atom affects the radiation field. The per- 
turbation term will accordingly call forth emission as well as 
absorption. To a sufficient approximation we may take for 91 
its value at the point occupied by the nucleus , provided we restrict 
ourselves to radiation whose wave-length is large compared with 
the dimensions of the atom. We now have 

eW = Z(Wq«. (13.3) 

» 

From this it follows than an element e • W nSt can only differ 
from 0 if s and s' are such that all rip = np with the exception 
of a single one n^, which must equal n x ± I. Then only the 
a th term contributes to the sum (13.3), and we have 

(<W'2U (13-4) 

Bohr’s frequency condition, which asserts that the emission or 
absorption of a photon in state a with energy hp * is associated 
with a quantum jump of the atom in which an amount 
± ~ v n /) = hp a of energy is lost or won, need by no means 

be satisfied here. The finite cavity has its own frequencies p xy 
and may therefore be in no position to take up the frequencies 
associated with the quantum jumps of the atom. This is true 
in principle, but as a matter of fact, as we shall see, Bohr's 
frequency condition is fulfilled to a very close approximation in 
the overwhelming majority of all transitions ; and this is more 
and more the case the larger the cavity is. 

Let the atom be in the state n and the radiation in the 
state 5 = {n x }. We set 

Zhn xPx =V-U(p)d P , (13.5) 

where the sum on the left is to be extended over those indices a 
for which p x lies between p and p + dp ; hence U(p)dp is the 
energy density of the radiation contained in the frequency 
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range p p + ip. In accordance with (10.5), the probability 
that the atom will find itself in the state ri after time t is given 

2 1 - cos (v , + p,J t . | n/>i |, (13.6) 

h 2 H K«' + M 2 

The contribution to this sum due to the cases in which a photon 
is emitted is, in accordance with equations (13.2), (13.4), given 
by 

2_ 1 — c os (»„ „> — p«)t _ A(n„ + 1) , !fi | a 

Z.2 & “ 


2p* 


(13*6,) 


(13.6.) 


A 2 -p«) 2 

and that for absorption by 

2 1 — cos ( v "n' + />«)* . Aw* I /i or >1* 

t-«2< 7 _i_ “ \a n~ W«n'*W • 

A 2 » (»W + P«) 2p a 

Consider first the case in which the term level v n > is higher than 

j# • Vfln , = v n — iv = — v is then negative. We now collect 
together all those terms a in the sum (13-6») for which p„ lies 
between p and p -l- dp. Since the position of the atom is not 
exactly fixed — even in consequence of the variations caused by 
the emission of photons — we may, for small wave-lengths, 
replace 2l 2 by its mean value in/V as given by the normalizing 

equation |2l \dV = 4tt, and we may also assume that all 

directions are equally probable for 91 a . The square |(2t*q)| 2 of 
the scalar product of 21 with a fixed vector q has then the mean 

(13.6 a ) then becomes 


value fp 


1 — cos (p 


v)t 4tt |q nn > 
3 V 


* Z h n “ p * 

2p 2 • 


(p - v)' 

On introducing (13.5) the sum (13.6 a ) may, to a good approxima- 
tion, be replaced by the integral 

4 7T lq nn 'l a ( 1 — cos (p — v)i _ Ujpjd p 
3 


A 2 


(p - ,) 2 


Essentially the only elements which contribute to the value of 
this integral, for a time f large in comparison with the duration l/v 
of an oscillation, are those for which p lies near to v. On developing 

P 2 ~ v 2 i_ 

in powers of p — v, the first term in the expansion contributes 
+ 00 

JJ[ v) , f 1 — COS X 


t 


dx 




(13.7) 
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to the integral ; all others are to be neglected. Similarly the 
entire amount (13.6 e ) due to emission is negligible, for its de- 
nominator (p 4" y) 2 vanishes nowhere. This means that the 
transition is almost invariably associated with the absorption of 
a photon whose frequency lies very close to v. The probability 
that the atom will appear in the higher state n' after lapse of 
time t increases in proportion with t ; the factor 


47 T 2 U(v) | . 1 2 477 2 

T (h/jj*' |c,wn 'i W 


u(v) 


is the probability that the transition n -> n f take place in unit time . 

This formula was obtained for the case in which the state 
n! possessed a higher energy level than n. In the reverse case 
only the sum (13. 6 e ) due to emissions contributes an appreciable 
amount. We now put v nn > = v n — v n > = v and obtain the same 
formula with this difference : in place 'of n a we now have n * + 1, 
or in place of the sum (13.5) the sum 


JtJhin# 4 1 )p<x — Uhnapx 4~ 2Jhpa,. 


The first is V • U(p)dp, and we denote the second by V * u(p)dp. 
This latter is equal to (hp) times the number of modes of vibra- 
tion of the cavity within the frequency interval p y p dp ; hence 
by (13.1) 


V * u(p)dp = V 


hp 3 dp 


w(v) = 


hv 3 


The probability that the atom drop from state n into the lower state 
n' in unit time is given by 


~^[U(v) + u(y))\q nn ]\ 


The additional term u(v) is characteristic for spontaneous 
emission . When the radiation is not enclosed in a black 
body, i.e. when there is no radiation density U(y) } the proba - 
bility that the atom drop from the state n to the lower state n 
in unit time , emitting thereby a photon whose frequency lies 
in the immediate neighbourhood of v = v n — v n >, is 

mrM*- 

This agrees with the formula obtained by integrating (8.11) 
over all directions. The probability that the atom jump from 
the level n into a higher level n' (i v n > > v n ) under these same 
conditions is zero. 
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In the energy field of the black body radiation we find not 
only absorption, but also “ stimulated emission both of which 
are proportional to the energy density U(v). On setting 


(13.8) 


the probability for a jump from state n to a higher state n' in 
unit time is 



Wn 


U(y) I (v = v n , — y„), 


(13.9) 


and the probability for the inverse jump, the drop from n' 
to n, is 


= A n , n [U(y) + u(v)] [ . 
Since ||q„ B ,|| is an Hermitian matrix, 



(13.9) 

(13.10) 


If there are a number of atoms in the radiation field and the 
whole system is in a steady state, then on the average as many 
atoms must make the jump « -> n' in unit time as make the 
inverse jump n' -> n. On denoting the number of atoms in 
the state n by N n , these considerations are expressed in the 
condition 


or 


An' ‘ N n U(v ) = A'n * A'[U(v) + u(v)] 
A _ , , _w(v) 

A + u( v y 


(13.11) 


The probability coefficients A nn , = A n , n have entirely dis- 
appeared— or rather, almost entirely, for the equation is valid 
only under the assumption that A nn , =f= 0 or q nn , 4 = 0, i.e. 
the transition is not to be forbidden by the selection 

rules. But for such a system in thermal equilibrium N„ must 
as shown by Boltzmann, be proportional to ” 5 

where 9 is the temperature and k the Boltzmann constant. 
Equation (13.11) then becomes 


e Mv n ' - __ j _|_ 

or the Planck radiation formula : 


u ( v ) 

U(v) 


U{v) = 


ghvlkd | 7 
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this formula is valid for all frequencies v whose energies can be 
exchanged by the absorbing and emitting atoms in accordance 
with Bohr’s frequency condition. 29 

We have thus finally returned to the historical origin of the 
quantum theory. We must now add three remarks concerning 
this treatment, due to Dirac, of energy exchange between matter 
and radiation. In the first place, it is able to explain the fact 
that the spectral lines are not sharp , but possess a natural breadth* 0 
Secondly, we must inquire what causes this difference between 
absorption and emission, processes which are transformed into 
each other on changing the direction of time. Indeed, the 
fundamental mechanical and field laws are invariant under the 
transformation $-> — $! The answer is that this difference is 
due to the preferential direction in time involved in the application 
of the theory of probability ; we assume a fixed initial state and 
calculate, with the aid of transition probabilities, the distribu- 
tion over the various states at a later time, not the distribution 
which would result from the equations for an earlier time. If 
no assumption is made concerning this preferential direction, 
t should be replaced by \t\ in (13*7). And finally, the fact that 
we have here treated Maxwell’s equations as classical equations 
of motion, and as such have subjected them to the process of 
quantization, may give rise to serious doubts — for in our general 
formulation Maxwell’s equations are already the quantum- 
theoretic wave equations for the photon ! But we shall see 
in Chap. IV, § 11, that this method is in fact the correct one 
to employ in order to go from one corpuscle to an indefinite 
number of corpuscles. For since the number of photons must 
remain indefinite — as a photon can, in contrast to an electron, 
spring into being or disappear — the method of composition 
described in § 10 is not applicable to them. 



CHAPTER HI 


GROUPS AND THEIR REPRESENTATIONS 

§ 1. Transformation Groups 

T HE concept of a group , one of the oldest and most 
profound of mathematical concepts, was obtained by 
abstraction from that of a group of transformations} 

A point-field, a domain of elements which we call points, 
on which the transformations operate, underlies the trans- 
formations. This point-field may be either the totality of a 
finite number of individually exhibited elements or an infinite 
set, in particular a continuum such as space or time. A 
mapping or correspondence S of the point-field on itself is 
determined by a law which associates with each point p of the 
field a point p' as image : p p f == Sp ; two correspondences 
Sp and Tp are identical if for all points p the two image 
points Sp and Tp coincide. If the point-field contains a finite 
number of elements the correspondence 5 can be defined by 
giving explicitly the image point for each point p ; for infinite 
sets, however, the association is only possible by giving the 
law of the function S. 

Among such correspondences there is a particular one which 
associates with each point p the point p itself : p -*» p ; it is 
called the identity I. Two correspondences can be applied 
successively : if the first sends the arbitrary point p into p ' = Sp } 
the second p r into p n = Tp' } then the correspondence resulting 
from the composition of the two is defined by the association 
P P" = T {Sp) an d is denoted by TS (read from right to left I). 
The resultant correspondence depends on the order of the two 
factors 5 and T. In order that composition be possible it is 
essential that the correspondences are ones which map the 
point-field on itself, and not on another point-field. 

We shall restrict ourselves to one-to-one correspondences S: 
the . image points p f = Sp associated with p shall always be 
distinct, and each given point p ' shall appear as the image of 
one (and only one) of the points p . Consequently such a one-to- 

110 
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one correspondence S : p -> p' determines a second, the inverse 
S ~ 1 : p f p of S, which just cancels it : 

S'(Sp)=p, S(S'p')=p' or 

S^S = I, SS - 1 = I. 

The inverse of S~ x is again 5 and the identity I is its own inverse. 
The resultant TS of two one-to-one correspondences S, T is 
itself one-to-one, and its inverse is (TS)” 1 = S^T” 1 — for 
on inverting the correspondences p -> p' p" there results 
P" P' P‘ Henceforth we shall consider only those corre- 
spondences, also called transformations or substitutions , which 
are one-to-one. In this domain we have, in accordance with 
what has been said, the two fundamental operations of inversion 
and composition. 

Examples . — 1. Let the point-field consist of n elements 
exhibited individually ; bring them into a particular order by 
numbering them with the integers 

1, 2, • • n. (1.1) 

This numbering consists in a one-to-one reciprocal relation 
between the elements of the point-field and the integers or 
possible “ positions ” q in the series (1.1). A permutation con- 
sists in the transition from one such arrangement to another. 
If we wish to operate in space we may think of the positions as 
fixed compartments into which the movable elements can be 
laid, or, conversely, we may think of the elements as fixed and 
shift the movable numbers about. With each permutation is 
associated a one-to-one correspondence p -> p' which tells 
which element p' occupies, after the exchange, the position 
previously held by p. Insofar as the method of numbering is 
considered as left to convention, the permutation is nothing 
more than this one-to-one correspondence. The concept is to 
be understood in this way when we are concerned with the 
composition or successive application of permutations. 

2. A kinematical example of a group is offered by the motions 
of a space-filling substance, in particular those of a rigid body. 
The positions or numbers of the preceding example are here 
represented by the material points and the point-field is the 
space itself. The one-to-one correspondence p -> p' connects 
the initial with the final state : that material point which origin- 
ally covered the spatial point p is taken to the point p' by the 
motion. Congruent correspondences of space on to itself will 
also be briefly referred to as “ motions ” in the geometrical 
sense. 

The concept of a group of transformations is now readily 
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formulated. We understand by it any system ® of transforma- 
tions of a given point-field, which is closed in the sense of the 
following conditions : 

1. It contains the identity ; 

2. If S belongs to ©, then its inverse S*" 1 does also ; 

3. The resultant TS of any two transformations S, T of ® 
is also a transformation of ©. 

As examples we name the group of all n ! permutations of n 
things, the congruent mappings or “ motions ” of 3-dimensional 
Euclidean space, all homogeneous linear transformations in 
n variables with non-vanishing determinants (affine correspond- 
ence of an n-dimensional vector space) and the group of unitary 
transformations in n dimensions. 

If the point p goes over into p' by means of a transformation 
of the group ©, then p' is said to be equivalent to p (with respect 
to the group ®). The same concept is applied when we are 
considering instead of a point p a figure consisting of points. 
Expressed in these terms, the three requirements for a group 
are nothing other than the three axioms of equality : 

1. p is equivalent to p ; 

2. If p' is equivalent to p, then p is equivalent to p' ; 

3. If p 9 is equivalent to p and p n to p\ then p" is equivalent 
to p. 

According to Klein's Erlanger Program 2 any geometry of 
a point-field is based on a particular transformation group © 
of the field ; figures which are equivalent with respect to ©, 
and which can therefore be carried into one another by a trans- 
formation of @, are to be considered as the same. In Euclidean 
geometry this role is played by the group of congruency trans- 
formations, consisting of the motions referred to above, and 
in affine geometry by the group of affine transformations, etc. 
The group expresses the specific isotropy or homogeneity of the 
space ; it consists of all one-to-one “ isomorphic correspondences ” 
of the space on itself, i.e. those transformations which leave 
undisturbed all objective relations between points of the space 
which can be expressed geometrically. The symmetry of a 
particular figure in such a space is described by a sub-group of 
© consisting of all transformations of @ which carry the figure 
over into itself. The art of ornamental tiling, which was per- 
fected by the Egyptians, contains implicitly considerable know- 
ledge of a group-theoretic nature ; we here find, perhaps, the 
oldest fragment of mathematics in human culture. But only 
recently have we been able to formulate clearly the formal 
principles of this art ; attempts in this direction were already 
made by Leonardo da Vinci ) who sought to give a general and 
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systematic account of the various types of symmetry possible 
in a building. But the most wonderful symmetrical structures 
are exhibited in crystals, the symmetry of which is described 
by those congruency transformations of Euclidean space which 
bring the atomic lattices of the crystal into coincidence with 
themselves. The most important application of group theory 
to natural science heretofore has been in this field. 

The following considerations fit naturally into the present 
discussion. Let the point-field M on which the transformations 
S of the group © operate.be mapped on the point-field N by 
means of the one-to-one correspondence A:p-*q; the case 
in which the correspondence serves to introduce new numbering 
or new co-ordinates is of particular importance. Through this 
correspondence A of M on N the transformation S of M becomes 
a transformation T of N ; in the particular case mentioned above 
T is simply a description of the transformation 5 in the new 
co-ordinates. It is evident that to the composition of trans- 
formations 5 corresponds the composition of the corresponding 
transformations T of N and that a group © of transformations S 
goes over into a group § of transformations T. The relation 
between these two transformations is 

T = ASA" 1 , (1.2) 

for if we denote the transformation S by p -> p' and if q y q ' are 
the points of N associated with p , p' by A, then the transforma- 
tion q q' of N is effected by 

q-+ p p' q f ' 

We may also write Jp = A&A~ l . In particular, these considera- 
tions apply when N and M are the same point-field. 

§ 2. Abstract Groups and their Realization 

An arbitrary number of transformations of a given point-field 
on to itself can be applied successively ; we are of course not 
restricted to merely two. But when we perform this process 
step by step it is automatically reduced to a succession of com- 
positions of transformations taken two at a time : 

ABC • • • = A[B(C • • •)]. 

This possibility of performing an extended composition in steps 
involving but two transformations at a time shows that the 
associative law 

(AB)C = A(BC) 

holds for any three transformations A, B, C. 
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The structure of a transformation group is obtained from it: 
by abstraction wh*en we allow the transformations themselves 
to degenerate into elements of an immaterial nature, retaining 
only their individuality and the rules in accordance with which 
two given transformations are composed, in a given order, to 
form a third. In accordance with what has been said such 
composition necessarily obeys the associative law. Perhaps it: 
also obeys other universal laws, but since we have at present: 
no indication of this we attempt a formulation of the abstract: 
structure of the group by means of the following definitions : 

An abstract group is a system of elements within which a law 
of composition is given such that by means of it there arises from 
any two [the same or different) elements a , b of the group , taken in 
this order , an element ba. The following conditions shall thereby 
be satisfied : 

1. The associative law c(ba) = (cb)a ; 

2. There shall exist an element I, the unit element , which leaver 
an arbitrary element a unaltered on composition with it : 

la = al = a. 

3. To each element a shall exist an inverse ar x which yields on 
composition with it the unit element I ; 

aa~ l == a~ x a = I. 

Such an abstract group is not to be confused with its reali- 
zation by transformations , i.e. by one-to-one correspondences of 
a given point-field. A realization consists in associating with 
each element a of the abstract group a transformation T(a) of 
point- field in such a way that to the composition of elements ojf 
the group corresponds composition of the associated transformer- 

tinw c * 

T(ba) = T{b)T(a). (2.1) 

It follows from this that to the unit element I corresponds the 
identity I and to inverse elements a , a™ 1 correspond inverse 
transformations : 

T{ar x ) = T~ x (a). (2.2) 

The first assertion follows from the particular case 

T(a)T(l) = T(a) 

of (2.1) by left-handed composition with the reciprocal of the 
transformation T{a) ; (2.2) is then contained in (2.1) as the 
particular case b = a -1 . The realization is said to be taithtiMM 
when to distinct elements of the group correspond distinct: 
transformations : 


T(a) 4 = T(b) when a 4= b. 
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In accordance with the fundamental equation (2.1) the necessary 
and sufficient condition for “faithfulness” is that T(a) shall be 
the identity only if a is the unit element. For if a } b are two 
elements of the group it then follows from T(a) = T(b), i.e. 

T{a)T-\b) = T(a)T(b~ l ) = T(ab ~ *) = I 

that under these conditions ab _1 = I, i.e. a = b. If the abstract 
group is obtained from a transformation group © by abstraction, 
then conversely © is a faithful realization of it. 

In the study of transformation groups we always deal with 
two manifolds, the structureless point-field and the manifold of 
group elements, the structure of which is expressed by the law 
of composition. The original problem thus resolves itself into 
two ; the examination of the various group structures possible 
and the examination of the possibility of obtaining realizations 
of the given abstract group by transformations of a given point- 
field. The historical development of the subject has shown that 
it is advantageous to effect this division into two problems ; 
they are of fundamentally different character and require 
fundamentally different mathematical equipment for their 
discussion. 

In accordance with our method of introducing the abstract 
group, which we henceforth refer to simply as the group, it 
serves merely to give the structure of the group ; the nature of 
its elements is immaterial. This abstraction from the nature 
of the elements is expressed mathematically by the concept of 
isomorphism. If we have two groups g, g' and there is as- 
sociated with each element a of g an element a' of g' in a one- 
to-one way : a', such that 

(i ba) f = Va\ (2.3) 

then the two groups are said to be simply isomorphic. Simply 
isomorphic abstract groups offer no means of distinguishing one 
from the other. The concept of isomorphism can, of course, be 
applied to transformation groups. Two isomorphic transforma- 
tion groups can be considered as faithful representations of 
one dnd the same abstract group. A group may be isomorphic 
with itself ; it is then said to be automorphic. Such an auto- 
morphism occurs when g and g' coincide, i.e. when a one-to-one 
reciprocal association satisfying the condition {2.3) is 

established between the elements of the group g. 

The question arises whether or not every abstract group 
possesses a faithful realization. If this were not the case the 
concept of an abstract group as developed above would be too 
broad — there would exist, in addition to the associative law, 
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other purely formal laws for the composition of transformations 
which are satisfied by every transformation group. Conversely, 
a proof of the realizability of any abstract group would tell us 
that all that can be said about the formal laws for the com- 
position of transformations is contained in our conditions (1) 
to (3). We can, in fact, construct a faithful realization of any 
abstract group g by taking as the point-field the group manifold 
itself and letting correspond to each element a of the group 
the transformation 

s s' — as 

of the group manifold on to itself. This “ left-translation ” 
ta is obviously a one-to-one reciprocal transformation which 
has as inverse the transformation s = ar l s\ If a and b are 
distinct elements the corresponding transformations t a , t& are 
distinct, for they allow the unit element I to correspond to the 
distinct elements a, b respectively. If we perform in succession 
two left-translations 

s-> s' = as y s' s" = bs' 

the resulting transformation is, in consequence of the associative 
law, 

s s" = b(as) =ss (ba)s. 

Consequently the left-translations constitute in fact a faithful 
realization of the abstract group. However, the right-trans- 
lations behave otherwise, for if we denote the mapping 
s-+s' = sa of the group manifold on itself by t*(a), we find 
instead of (2.1) the equation 

t *{ba) = t*(a)t*(b). 

§ 3. Sub-groups and Conjugate Classes 

A sub-group %' of a given abstract group g is a set of elements 
contained in g which itself fulfils the characteristic group con- 
ditions : the unit element I belongs to g', with a belongs also 
ar 1 and with a t b also ba. These three conditions can be reduced 
to the one : if a } b are any two elements of g', then ba~ x also 
belongs to g'. We assume, of course, that the partial system 
consists not merely of the element 1, but the other limiting 
case, in which g' coincides with g, shall be included under the 
concept of a sub-group. 

Examples are readily found. In the group of Euclidean 
motions are contained, for example, the group of rotations 
(which leaves one point, the centre, fixed) and the group of 
translations. The unitary transformations constitute a sub- 
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group of the complete group of all homogeneous linear transforma- 
tions ; the even permutations a sub-group of the group of all 
permutations. If we are dealing with a transformation group 
©, all those transformations of © which leave a particular 
point p fixed (i.e. which carry p over into itself) constitute a 
sub-group ©p. Instead of a point p the fixed element may be 
any figure composed of points ; the transformations of the sub- 
group must either leave the figure as a whole fixed (i.e. they must 
carry each point of the figure over into another such) or the 
more restrictive condition that they leave each point of the 
figure fixed. We can also obtain sub-groups of © by employing 
invariant functions instead of invariant figures. If if(p) is any 
function of position on the point-field with elements p we as- 
sociate with the transformation S : p p f the function i/j' 
defined by i/t'(p') = *l*(p) and say that it is obtained from i/j by 
the transformation S. If p' = Sp, p" = Tp\ the equations 

m = w) = ripi 

show that the composition of the transitions i/j p' and 
ifj r -> i//' associated with S and T result in the transition i/j i /»" 
associated with TS . Now consider all transformations S of © 
which carry ifj(p) over into itself, i.e. for which */j(Sp) = */j(p) is 
an identity in p ; they constitute a sub-group § of ©, and 
\fj{p) is an invariant of §. In this way we can separate out 
the rotations from the homogeneous linear transformations by 
requiring the invariance of the unit quadratic form. The sub- 
groups contained in a finite group g, which is described by 
exhibiting each of its elements and giving explicitly the result 
of composition of each two, can be obtained by inspection. 

There is associated with each element a of the group g a 
cyclic sub-group denoted by (a) : 

• • *, “~ 2 , «° = I, a, • • •, (3.1) 

the elements a n of which are defined inductively by the equations 
a 0 = I, a n+1 = a n a. 

These elements constitute in fact a group, for n and m being 
any integral exponents we have 

a n+m = a n a m t 

(a) is the smallest sub-group which contains a, i.e. its elements 
are common to all sub-groups of g which contain ( a. The 
elements of the set (3.1) can either be distinct or— -and this 
latter must be the case if g is a finite group — they must repeat 
themselves after a cycle of h terms : I, a } a?, • * *, a h ~ l are 
distinct but a h = I. h is called the order of the element a . 
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The order of a finite group is the number of its elements ; 
accordingly, the order of an element a agrees with the order of 
the cyclic sub-group (a) generated by a. A group is said to be 
commutative or Abelian if composition of its elements obeys 
the rule ba = ab. Cyclic groups are therefore Abelian. 

If a runs through the sub-group f) of g the associated (left-) 
translations t a constitute a group of transformations which is 
simply isomorphic with f), the point-field of which is the group 
manifold. We say that two elements s , s' which are equivalent 
with respect to this transformation group are (left-) equivalent 
with respect to f) and express this situation by the notation 
with respect to 1) ” ; the condition for it is that s' = as 
where a is an element of f ). In this way the elements of g are 
divided into sets of elements which are equivalent to f). If 
the number of such sets is finite, it is called the index of 1) in g. 
If g is a finite group the number of elements in each of these 
sets is given by the order of f ), for different translations t a send 
5 into different elements : as =)= bs if a 4= b. The order of f) is 
accordingly a divisor of the order of g, and the quotient of these two 
is the index of f ). 

The considerations at the end of § 2 above, which were 
developed for groups of transformations, suggest a second 
realization of the abstract group g. We associate with the 
element a the correspondence 

s->s' — asar x (3.2) 

of the group manifold on itself. This correspondence, which 
we call the “ conjugation ” I fl , is reciprocal one-to-one, and has 
as inverse s = a' 1 s' a. The law of composition is obeyed, for 
from 

s-+ s' = asa r\ s' -> s" = bs'b~ l 
we obtain the product 

s" = basar 1 b~ l = (ba)s(ba)“ 1 . 

Two elements s, $' of g are said to be conjugate if they are 
equivalent with respect to the group of all conjugations. Ac- 
cordingly, the whole group is divided into classes , any element 
of one of which is conjugate to any other element of the same 
class. When we speak of classes within a group without a 
more explicit description we mean these conjugate classes. 

The realization of g by the group of conjugations is in general 
a “ contracted ” rather than a faithful realization. In particular, 
the conjugation f fl coincides with the identity if a commutes 
with all elements s of the group. The totality of all such ele- 
ments a is called the central of the group ; it is obviously 
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an Abelian sub-group of g. But this disadvantage of con- 
jugation over translation is offset by an advantage ; conjugation 
is an isomorphic correspondence within the group itself which 
leaves the unit element invariant and which associates with 
each sub-group f) of g another such, the conjugate sub-group 
dfya~ l . These facts, which are expressed by the equation 

a(st)a~' 1 = (asar 1 ) (ata ~~ x ) , 

were already contained implicitly in the considerations at the 
end of § 1. f) is said to be a selUconjugate or invariant 
sub-group if it coincides with all its conjugate sub-groups. 

The importance of this last concept is best seen in the 
following : 

Theorem, //f) is an invariant subgroup and & denotes equiva- 
lence with respect to it , then it follows from 

s' s s, t' s t that s't ' = st. (3.3) 

To prove this we note that s' = as, t' = bt (a, b in f)) yield 

s't' = asbt = ( ac ) (st). (3.4) 

for c = sbs -1 belongs to f) with b. Since ac lies in f) our assertion 
is proven. It is readily seen that the invariantive nature of f) is 
necessary as well as sufficient for the validity of (3.3). In deal- 
ing with an invariant sub-group f) we need not distinguish 
between right and left equivalence with respect to f) — indeed, 
the above proof was based on this fact. 

We may, if we like, consider equivalent elements as not 
differing from one another (by application of the principle of 
definition by abstraction) ; but by thus allowing equivalent 
elements to fall together the group property of g is, in general, 
forfeited. In accordance with the above theorem it still remains, 
however, if f ) is an invariant sub-group. The group obtained 
from g by identifying all elements which are equivalent with 
respect to f) is called the factor group g / f) ; its order is the 
index of the invariant sub-group f) of g. 

These concepts are of assistance in examining the way in 
which a group may be “ contracted ” on setting up a realization. 
Let the transformation T(a ) of a given point-field on itself 
correspond to the element a of the abstract group g in the realiza- 
tion under consideration. Then T(a) = T(a') if and only if a' 
is obtained from a by composition with an element e (i.e. a' = ea) 
for which T(e ) is the identity. Such elements e obviously con- 
stitute a sub-group 1) of g, for it follows from 

T(e) = I, T(e') = I that T(ee') = T(e)T(e') =■ /. 
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is, in fact, an invariant sub-group, for if T(e) is the identity, 
the same is true of 

T(aea~ 1 ) = T(a)T(e)T^{a) = T(a)T“'(a). 

In any realization of an abstract group $ by a group of transforma- 
tions the elements of a certain invariant sub-group t) of g correspond 
to the identical transformation ; two different elements will be 
associated with the same transformation if and only if they are 
equivalent with respect to f). The group of transformations is 
consequently a faithful realization of the factor group q/ff. 

§ 4. Representation of Groups by Linear 
Transformations 

On requiring that the transformations which are to serve 
as a realization of a given abstract group g be linear and homo- 
geneous we arrive at a problem which is most fruitful from the 
mathematical standpoint and which is at the same time of 
greatest importance for quantum mechanics ; we then speak of 
a representation , instead of a realization, of the group.* An 
tt- dimensional representation of g, or a representation of degree n , 
consists in associating with each element s of the group an 
affine transformation U (s) of the ^-dimensional vector space 
9i = 9i n in such a way that these transformations obey the 
law of composition 

U(s)U(t) = U(st). (4.1) 

We then say that s induces the transformation U(s) in the 
representation space 91. On choosing a definite co-ordinate 
system in 9i each transformation U(s) is represented by a square 
matrix of n rows and columns, the determinant of which does 
not vanish. On replacing the original co-ordinate system by 
another, obtained from it by the transformation A } the corre- 
spondence which was formerly represented by the matrix U(s) 
is now represented by the matrix AU(s)A~\ Consequently if 
the association s^> U(s) is a representation, the association 

s “» AU[s)A ~’ x 

is obviously also one ; this latter representation is said to be 
equivalent to the former. They are essentially the same, 
differing only in the choice of the co-ordinate system in terms 
of which they are described. 

Examples . A representation in one dimension consists in 
assigning to each element 5 of the group a non-vanishing number 
X(s) in such a way that 

X(st) = x (s) x(0- 


(4.2) 
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In particular, x(I) = 1* A most trivial 1-dimensional repre- 
sentation is obtained by assigning to each s the number 1 : 
x(s) = 1. This special case is called the identical representation . 

Consider next the so-called symmetric group , the group 
77 = rrf of all /! permutations of / things. The association 

.s — > = -j- 1, 

according as 5 is an even or an odd permutation, defines a 
1 -dimensional representation, the 44 alternating ” representation 
of the group rt. For the character 8*, which distinguishes 
between the even and the odd permutations, satisfies the 
equation 

S 8 t = S* • 8/. 

Let g be a finite cyclical group of order h ; the elements 
s are then 

l, a, a\- • •, a*" 1 

and ah = I. Consider the 1 -dimensional representation %(s) 
in which x( a ) = The condition (4.2) for a representation 
then tells us that to the elements s of this series correspond 

1 p p2 ... e^—1 

X, C, C , , , 

and that to a h corresponds e h . Hence e A = 1 ; e must therefore 
be an h th root of unity and the law defining the representation 
is a r e r (r = 0, 1, 2, • * •). Conversely, when e is an arbitrary 
h ih root of unity this association defines a 1-dimensional re- 
presentation of g. We have thus obtained a complete survey 
of all possible 1-dimensional representations of a cyclical group. 

The only example of a multi- dimensional representation 
which we offer at this time is the following trivial one. If 
fj is itself a group of linear transformations of an w-dimensional 
vector space % then the association s -> 5- defines an w-dimensional 
representation of g. This example implies more than one might 
at first sight imagine. We have in fact to do the following : 
we first obtain the structure of the group g by abstraction from 
the group of linear transformations and then return to the 
original realization by means of the correspondence s -> s 
between an element s of the abstract group on the one hand 
and the linear transformation s on the other. 

The concept of equivalence has a more general significance 
than that discussed above. It may refer to an arbitrary system 
E of linear correspondences U of the n - dimensional vector 
space 91. We need not assume that these correspondences 
possess an inverse (i.e. that they have a non-vanishing deter- 
minant), nor need we assume that they are associated with 
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the elements s of a group, a$ is the case with representations. 
On expressing the set of correspondences U in terms of a new 
co-ordinate system each matrix U goes over into the matrix 
U f = AU A" 1 ; the system 27 is transformed into the equivalent 
system 27' consisting of the U\ A is here a fixed non-singular 
matrix. 

Consider a correspondence U of 91 on to itself. A linear 
sub-space SR' of SR is said to be invariant under U if the vectors 
of SR' are transformed into vectors of SR' by U. If SR' is invariant 
then the space SR (mod. SR') obtained by projecting SR with 
respect to SR' is also invariant (cf. I, § 2, in particular Fig. 1). 
SR' being invariant, U gives rise to a correspondence U' of SR' 
on to itself ; we say that U induces U' in SR'. Similarly for 
the space obtained by projection. We now pass from a single 
correspondence U to a system 27 of correspondences. SR' is 
said to be invariant under 27 if it is invariant under each corre- 
spondence U of 27. Describing SR in terms of a co-ordinate 
system which is adapted to the invariant sub-space SR', all 
matrices U of the system 27 reduce simultaneously to the form 
illustrated in Fig. 1, p. 8. 27 is called irreducible if SR con- 

tains no sub-space, other than SR itself and the space 0 consisting 
only of the vector 0, which is invariant under 27. We shall 
have occasion to reduce SR in such a way that each constituent 
separated off is irreducible under a given system 27. This 
requires the construction of a series of sub-spaces 

0,$Ri, SR 2 , - • •, SR r = SR, (4.3) 

beginning with 0 and ending with SR, in which each member 
is contained in the preceding one and is such that flti (mod. SR^i) 
is irreducible. Naturally SR* shall actually be larger than SR<_ lr 
not merely coincide with it. The implications of this reduction 
are most readily seen in terms of the matrices U of the corre- 
spondences of the system 27 on adapting the co-ordinate system 
to the “ composition series ” (4.3), i.e. by choosing first a co- 
ordinate system in SR*, then supplementing it with additional 
fundamental vectors in order to obtain a co-ordinate system 
for SR* SR 3 , • • • in turn. 

27 is said to be completely reducible if SR can be decomposed 
into two sub-spaces SR + 9R', each of which are invariant under 2? 
and such that neither of them consists merely of the vector 0. 
This concept of complete reducibility is more exacting than that 
of mere reducibility . On describing SR in terms of a co-ordinate 
system which is adapted to this decomposition, each matrix 
U of 27 assumes the form illustrated in Fig. 2, p. 9. We are 
then faced with the problem of decomposing SR (or 27) into 
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constituents, none of which is completely reducible, i.e. of 
decomposing 3FI = -f- • • * -f- 9t fc into invariant sub-spaces, 

none of which is completely reducible. 

We often find that reducibility implies complete reducibility, 
i.e. that in many cases we have the theorem : If 91' is an in- 
variant sub-space of $R, a second invariant sub-space SR" can 
be found such that SR is completely reducible (with respect to 
E) into SR' + SR". We shall soon see that this is actually the 
case when SR is a unitary space and E is a system of unitary 
transformations. 

It was shown in Chap. I, § 3, that if the system E is re- 
ducible, then the system E* of “ transposed ” correspondences 
of the dual space on itself is also reducible. If § : s U(s) 
is an n-dimensional representation of the group g the transposed 
U*(s) do not constitute a representation ; it is readily seen, 
however, that on employing instead the contragredient corre- 
spondences 

U(s) = [U*(s))-' 

we do obtain a representation s U(s) of the dual vector space. 
This we call the contragredient representation §. 

§ 5. Formal Processes. Clebsch-Gordan Series 

Continuous groups offer what are perhaps the simplest 
examples of the theory of representations. We consider in 
particular the group c = c n of all linear and homogeneous trans- 
formations 5 in n variables x J} x 2) ' * with non-vanishing 
determinants ; we consider each set of values x { as a vector 
in, an n-dimensional vector space t = t n . The classical theory 
of invariants, first developed in England about the middle of 
the last century, concerned itself in particular with the repre- 
sentations of c induced on the coefficients of arbitrary forms 
in the variables x { . A quadratic form in these variables is a 
linear combination of the n(n + l)/2 linearly independent 
products x { x k ; under the influence of a linear transformation 
s of the %i these products undergo a linear transformation [. s ] 2 , 
and the correspondence [s] 2 is obviously a representation 
[c ] 2 in n(n + l)/2 dimensions of the group c. The transformation 
^ of the variables x € sends the arbitrary quadratic form 

h x i 

into a quadratic form 

A A 
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in the new variables, where the coefficients a\ k are obtained 
from the a ik by a certain linear transformation s 2 associated with 
S] s t is obviously contragredient to [s] t . The quadratic form 
characterized by a fixed set of‘ n(n + l)/2 coefficients a ik may 
therefore be considered as a vector in a space of this number of 
dimensions, and the transformation s of the variables induces 
the transformation s$ in this space. The space thus defined by 
the totality of w-ary quadratic forms is thus the point-field for 
a group of linear homogeneous transformations which constitute 
a representation of the group c. 

We may in the same way deal with cubic, quartic, • • •, 
/- ic forms. The totality of monomials of order / are contained 
in the formula 

x{ x #£ * * • xfr (5.1) 

where the /,* are non-negative integers whose sum 

fi + ft + * * ’ +/*=/• 

They constitute the substratum of a representation [c]f in 
~nl __ n(n + 1) • • • [n + / — 1) 

If J 1*2 • • - J 

dimensions. 

But we can exhibit representations of c which are formally 
yet simpler than these arising from the theory of forms. Let 
(x { ) and (y t ) be two arbitrary vectors in our n-dimensional 
space r and consider the products x i y k . On subjecting the x i 
and the to the same transformation ^ of c (transition to a 
new co-ordinate system) the n 2 products undergo a certain 
linear transformation s X s associated with s and the corre- 
spondence s-> s X s is an n 2 - dimensional representation (c ) 2 of c. 
Now a system of numbers F(i, k), depending on two indices i, k 
which run through the values 1, 2, • • •, n, is said to be a tensor 
of second order if under the influence of a transformation $ of 
r the F(i , k) undergo the same transformation as the products 
Xi y k of the components of two arbitrary vectors £, of t. Hence 
the tensors of order 2 are the substratum of the representation 
(c ) 2 of c. (c ) 2 contains the representation [c ] 2 which is induced 
in the sub-space of symmetric tensors of order 2 ; the tensor 
with components Ffi, k) being symmetric if F(ik) = F(ki ). 

In geometry the antisymmetric tensors, i.e. tensors whose 
components satisfy the condition F[ik) = — F(ki) t play a more 
important role than the symmetric ones. - In particular, two 
arbitrary vectors (#,•), (y € ) define a surface element with 
components 

x{ik} = Xi y k - x k yi ; 
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of these quantities but n(n — 1) /2 are linearly independent, 
say those for which i < k. On subjecting the components x { 
of the vector J and the components y { of the vector t) to the 
same linear transformation s, the components of the surface 
element defined by them undergo an n[n — l)/2-dimensional 
linear transformation {s} 2 . 5 {^} 2 is a representation {c} 2 

whose substratum is the totality of anti-symmetric tensors of 
order 2. Hence the representation (c) 2 is reduced into the 
representations [c] 2 and {c} 2 , for any tensor F(ik) can obviously 
be written 

F(ik) = \{F(ik) + Flki)] + l[F(ik) - F(ki)l 

i.e. in a unique manner as the sum of its symmetric and anti- 
symmetric parts. That this reduction is correct is further borne 
out by the fact that the dimensionalities satisfy 

-.5 _ n i n + !) , »(« — !) 
n 2 h 2 ' 


Similarly three arbitrary vectors j, ty, $ determine a 3-dimen- 
sional element of volume with components 


x{ikl } = 


%i x* 

yi jk yi 

Zi Z k Zi 


(5.2) 


These elements constitute the substratum of a representation 
{c } 3 in 

(n\ __ n(n — l)(n — • 2) 

\3J" 1-2-3 


dimensions. Continuing in this way we can construct 4-, 
5-, * * *, w-dimensional elements ; this process must cease with 
w-rowed determinants, for a determinant of the form (5.2) with 
more than n rows must necessarily vanish identically. 

We shall see that the representations of c whose substrata 
are the symmetric and anti-symmetric tensors of order / are 
irreducible, and shall in fact solve the general problem of effect- 
ing the complete reductions of (c)-f, the representation induced 
by c in the space of all tensors of order /, into its irreducible 
constituents (Chap. V). 

The tensor concept really depends on the X -multiplication 
introduced in II, § 10. If the m variables undergo a trans- 
formation A and the n variables y k a transformation B } then 
the win products x$ k undergo a transformation A X B. Con- 
sidering the Xi as the components of an arbitrary vector £ in 
an w-dimensional space 5R ru and the y k as the components of 
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ty in 9t n , the products x t • y k may be considered as the components 
of a vector j X t) in an mn-dimensional vector space 9 X 9t n . 
Hence two representations 

$:s^U(s), &:s-> U'(s ) (5.3) 

of g in m } n- dimensions, respectively, give rise to a new nm- 
dimensional representation which we denote by ip X ip' : 

E7($) X C7'(*). (5.4) 

This presents a general method of obtaining a new representa- 
tion § X from two given representations $, ip'. 

Denoting the representation s -w of the linear group c for 
the moment by (c), the representations of c whose substrata 
are the tensors of order 2, 3, • • • are then (c) X (c) = (c) 2 , 
(c) X (c) X (c) = (c) 3 , 

We should, perhaps, have discussed the addition + of two 
representations before discussing their multiplication X . Con- 
sider the variables x { and y k as the components of a single vector 
} in an (m -+- n) -dimensional vector space ; when the x { are 
subjected to the transformation A and the y k to the trans- 
formation B these m + n variables undergo a certain trans- 
formation ( A , B). Hence we obtain from (5.3) the representation 

§Xfd’:s^[U(s), U'(s )] 

in m + n dimensions. The inverse of this process is complete 
reduction, as discussed above : ip + & * s completely reducible 
into the components ip and ip'. 

Another important formal method is the following : Any 
representation r in iV- dimensions of the linear group c n in 
n-dimensions may be used to construct an iV-dimensional 
representation of any abstract group g from an rc-dimensional 
representation ip of the same. J 1 associates with the linear 
transformation u in n-dimensional space a linear transformation 
U in N dimensions, so if ip : s -* u is an w-dimensional repre- 
sentation of the group g with elements s, then 

s ^ u-> U 

is an N - dimensional representation s-+ U of g which we may 
denote by -T(ip). To this is due the importance of the repre- 
sentations of the linear group for the general theory of repre- 
sentations. For example, take JH to be the representation of 
c whose substratum is the dual space, the space of all tensors 
of order 2, of the symmetric or anti-symmetric tensors of order 2, 
etc. ; we then obtain from the representation § of the abstract 

group g the representation Jp, ip X ip, [$ X $]. {$ X §}, etc. 
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The three most important formal processes are (1) addition, 
(2) X -multiplication, and (3) the JH process. The first two 
generate a new representation from one or two given repre- 
sentations, the third a new one from a given representation. 
The first two are completely circumscribed, but the third 
contains a general method, for F may be any representation of 
the linear group c n . 

If g' is a sub-group of g, then any representation 
Q : s U(s) of g contains a representation of g' ; we need only 
let the element s run through the sub-group g' ! This too may 
be considered as a formal process (4) which generates a repre- 
sentation of g' from a given representation of g. 

The X -multiplication occurs in yet another connection. 
Given two groups g, g', we can consider the pairs (s, s'), the 
first member s of which is an element of g and the second s' 
an element of g', as the elements of a new group g X g', the 
direct product of g and g', obeying the multiplication law 

(S, s')(t, 0 = (st, s't'). 

The order of g X g' is the product of the orders of g and g'. If 
<q:s-+U(s) is an w-dimensional representation of g and 
: s' -> U’(s’) an n'-dimensional representation of g', then 

(. s , s') -* U(s) X U'(s') (5.5) 

is obviously a representation in nn' dimensions of the group 
g x g' ; we denote it by § X (with a boldface X). This 
construction may be broken up into two steps. First introduce 
the representation 

(s, s') -> U(s) 

of g X g' ; there is no reason why we should not designate it 
by the same letter § as the representation j -> U(s) of g — we are 
accustomed to calling the function f(x), considered as a function 
of the two variables x, y, by the same letter as the function 
f(x ) of the single variable *. U(s) and U'(s') are thus to be 
considered as functions of the same variable pair (s, s ), and then 
the representation <p X §' of g X g' may be obtained by ordinary 
X -multiplication from § and §'. The differentiation between 
boldface X and ordinary X is accordingly purely pedantic. 

Examples. Unimodular Group in Two Dimensions 
Let g = c = c a consist of all linear transformations s of two 
variables x, y : 

x' — ax by, y' =■ cx ■+ dy , (5.6) 
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whose determinant ad — be ~ 1 (“ unimodular ” linear trans- 
formations *). A homogeneous polynomial in x t y of order / is 
a linear combination of the f + 1 monomials 

xf" 1 y, • • *, # yf” 1 , y f . (5.7) 

Under the influence of s they undergo a linear transformation 
which we denoted above by [s]f ; they constitute the substratum 
of a representation [c]f :s-> [s]f in /+ 1 dimensions which we 
now denote by ©/. ©/ is, although we have yet to prove it, 
irreducible. 

We can restrict ourselves within c to the sub-group c x of 
“ principal '* transformations which transform each of the 
variables separately : 

x'±=ax, y’ = \y, (5.8) 

where a =|= 0 is an arbitrary constant. C x is Abelian, This 
transformation multiplies the monomials of the set (5.7) by 

af , af~ 2 , • • •, <ar(/~ 2 ), arL 

On associating the number a r with the element (5.8) of c x we 
obtain a 1-dimensional representation which we denote for the 
moment by ©< r > ; here r can be any fixed integral exponent. 
We have just seen that the irreducible representation Sy of c* 
is completely reduced on restricting ourselves to the sub-group 
c x into /+ 1 one- dimensional representations ©< r > with r = f } 
f — 2, • • This is an example of the process (4). 

As an example of multiplication and addition we consider 
the problem of reducing the product G/ X ©^ of the two repre- 
sentations ©/, Gg of c into its irreducible components. The 
result is contained in the formula 

X = (5,9) 

where v runs through the series 

«=/+g>/+g- 2 . ‘ • | /~g| 1 (B-M) 

without repetition, decreasing by 2 from term to term. This 
equation is essentially identical with the Clebsch-Gordan series 
which plays such an important r61e in the theory of invariants 
of binary forms. We shall see in the succeeding chapters that 
it may justly be considered as the fundamental mathematical 

* c n will usually denote the group of all non-singular linear transformations 
in tt-dimensions ; it will however occasionally be used to denote the more 
restricted unimodular group, in which case the restriction will be explicitly 
stated. 
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formula for the classification of atomic spectra and for the theory 
of the valence bond. 

The proof consists in showing that 

«/ x = « /+ , + (©/_! X (5.11) 

for (5.9) then follows by mathematical induction and the fact 
that obviously 

A new co-ordinate system for the representation space of S/ 
is obtained by replacing the basis (5.7) of homogeneous poly- 
nomials of order / by another basis. In this sense we can say 
that the polynomials of order / constitute the substratum of' 
the representation The substratum of the representation 
©/ X & g is th$n the totality of polynomials 

<p = ${%y ; (vj) 

depending on the components of two arbitrary vectors [xy) } 
(£rj), homogeneous and of order /in the first, and homogeneous 
and of order g in the second ; we write the total order /+ g = h. 
The 0 are thus linear combinations of the (/ + l)(g + 1) 
monomials 

x { yk . ^ w here i + k = /, i + k = g. (5.12) 

Both vectors are transformed cogrediently under the same trans- 
formation (5.6). The problem consists in completely reducing 
the space of the polynomials 0 into two sub-spaces (0) o and 
(&)' which are the substrata of the representations S* and 
«/-i X respectively. We first discuss the structure of 
these two sub-spaces. 

($)o. Expand 

(ocx + f}yy(ct£ + f}rj)9 = <x h '</><> + -f • • • + P h '<f>h 

(5.13) 

in powers of the undetermined coefficients a, /?. The 
<t>i = <f>i(xy ; £rj) are special polynomials of the type 0 and span 
the sub-space (0) o . We must now show that this sub-space is 
invariant under the transformation (5.6) of the variables ; 
i.e. that fa = <f>i{x'y r ; £V) is a linear combination of the 
<f>t = (j>i[xy ; £ 77 ). It is clear that if this is the case then c in- 
duces the representation ©a in (0) O) for on identifying the two 
vectors 


<f>i becomes 


i = x, t) = y 

H x y ; x y) = xK ~ { * y‘- 


(5.14) 
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Hence we are certain a priori that the h -f- 1 functions <f> { are 
linearly independent. 

In order to arrive at the desired proof we replace x, y in 
(5.13) by 

%' = ax + &y, y f = cx + dy , 
and in the same way f, rj by 

f ' = ai; + br) } rj' = c£ + drj. 

Now note that cnx f + /3 y' is the linear form 
(oc# -J- fic'jx -j- (oc^ fid^y === 
in # and y ; hence 

{ax’ + /J/)/(a£' + /fy')* 7 = -f ByY(A ij + Brjjs, 
and by (5.13) 

z ({)«*-?' 

On replacing A, B on the right-hand side of this equation by 
A = ocd -f- f$c } B = ct.b -j- fid, 

and equating coefficients of we obtain <f>[ as a linear 

combination of the (f> k . 

(0)'. The substratum of the representation x 

consists of the polynomials 

£?) _ 

of order/ — 1 in ( x , y) and of order g — 1 in (f, rj). They are not 
polynomials of type 0 ; in order to increase the order in the 
components of each vector by 1 we replace each such by 

0 = (xr) - yi ) • V. 

The factor thus introduced in no way affects the representation. 

The last step in the proof consists in showing that the total 
space of polynomials 0 is completely reducible into these two 
sub-spaces ; i.e. in showing that any polynomial 0 can be 
written in the form 

0 = ( a 0 <f> 0 + + • • • + ah<f>h) + (xt) — yi;)*? (5.15) 

with unique constant coefficients a { . (The development in 
terms of powers of the determinant xrj — yf obtained from this 
by induction is the Clebsch-Gordan series.) First, the dimen- 
sionalities are correct, for 

(f+ l )(g + 1 ) = (/+ g + 1 ) + fg. 

Hence it^ suffices to show that the various terms in (5.15) are 
linearly independent, i.e. that an expression of the form (5.15), 
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in which W is a polynomial of order/— 1 in (x, y) and of order 
g — 1 in (£, rj ), can vanish only if W vanishes identically and if 
all the coefficients a { are zero. The proof is extremely simple. 
We first let (£rj) — (xy) as in (5.14), then the equation 0 = 0 
becomes 

atfc h + a-iX^y + * • * + Hy h = 0 

identically in x and y ; hence = 0. Having established this 
we return to the two sets of variables xy ; t and obtain the 
equation 

(xr) — y£)W = 0, 

from which it follows that W = 0 — in an algebraic identity 
for polynomials we may always remove a factor, such as 
xrj — y £ } which does not vanish identically. 

Our formula (5.9) also holds for the group C of all linear 
transformations of x, y with non-vanishing determinant. We 
must then interpret © v , v = h — 21 in (5.9) as that representation 
whose substratum is the totality of homogeneous polynomials 
of order v in % and y multiplied by (xrj — yg). In other words, 
the new ©*, differs from the old in that the transformation of 
the (v + 1) -dimensional representation space corresponding to 
s in the representation (£ y is to be multiplied by the Z th power 
of the determinant ad — be. 

(£/ X $ g is a representation of c 2 X c 2 , the group consisting 
of pairs (s, s') whose members s and s' run independently through 
the entire group c 2 . On introducing the restriction that s' is 
the element 5 obtained from s by replacing the coefficients of 
the linear transformation 5 by their conjugate complex,©/ X 
becomes a representation ©/, g of c 2 , the substratum of which 
may be taken as the monomials 

x iyk . x L y K (i + k = /, i + * — g) 

of order / in (x } y) and order g in (x } y). It can be shown that 
«/. g is also irreducible. 

§ 6. The Jordan-Holder Theorem and its Analogues 

Perhaps the most fundamental theorem of mathematics is 
that on which the concept of cardinal numbers depends. Let 
the members of a finite set of objects distinguished by marks 
a, b, c • * ‘be exhibited individually in this order and associated 
with the symbols 1, 2, • • • n. The theorem then states that 
the u number ” n is independent of the order in which the 
objects are exhibited. The proof of this theorem is of con- 
siderable mathematical interest and offers the simplest example 
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of the type of proof employed in establishing the Jordan- 
Holder theorem. A new enumeration consists in associating 
the symbol 1 with any one of the objects, the symbol 2 with 
any one of the remaining objects, etc., until the entire set is 
exhausted, the last object receiving the symbol n'. We assert 
that n' = n. 

The proof is divided into two steps. (1) If in the new enumer- 
ation the symbol 1 is associated with the same object a as in 
the old, our theorem for the series from 1 to n is reduced to that 
for the series from 1 to n — 1. This is immediately evident on 
discarding the object a and reducing by one the symbols as- 
sociated with the objects ft, c, • • • in the new as well as in the 
old enumeration. (2) If, on the other hand, the symbol 1 is 
associated with one of the other objects b, c, * • • then in the 
new enumeration the object a is associated with some symbol 
i contained in the series 2, 3, • • •, n'. We now introduce a 
third enumeration which enables us to make the transition 
between the first and the second by interchanging the symbols 
1 and i in the second enumeration. The number n' is obviously 
unaltered by this process. But we have now introduced an 
equivalent enumeration in which the object a is associated with 
the same syfnbol 1 as in the original and have reduced the 
general case to the one considered in (1) above. The proof of 
the theorem then follows immediately by the method of 
mathematical induction. 

As an auxiliary result of these fundamental considerations 
we have the theorem that any permutation can be obtained by 
the successive application of transpositions. 

The Jordan-Holder theorem is concerned with an abstract 
group g. An invariant sub-group g' of g which does not coincide 
with g itself is said to be maximal if there exists no invariant 
sub-group of g — except g' and g — containing g'. The factor 
group g/g' is then simple, i.e. it contains no invariant sub-group 
with the exception of itself and that consisting only of the 
unit element I. As was recognized by Galois , the so-called 
composition series 

9<> = 9, 9n 92, • • 9r-l, gr = I (6.1) 

is of fundamental importance for the solution of algebraic 
equations. This series begins with g and ends with I , and each 
member is a maximal invariant sub-group of the preceding 
member. We assume that the composition series terminates ; 
this is naturally the case for finite groups, as the order necessarily 
decreases from term to term. The successive factor groups 

9/01. 0l/02> * • 0r-l/0r = 0r— 1 (6-2) 
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are simple. The Jordan-Holder theorem asserts that the 
structure of these factor groups , except for the order in which they 
appear , is uniquely determined by g. 

Consider, therefore, a second composition series 

So 3> 9i> 02) 

of the same group g ; it is to be compared with the “ standard 
series ” (6.1). The proof of the fact that this new series also 
contains exactly r + 1 terms and that the corresponding factor 
groups are, except for the order in which they occur, isomorphic 
with the factor groups (6.2) is again accomplished in two steps. 

(1) If the two second members g' 1} g x coincide, the theorem 
for the group g, whose standard series contains r + 1 members, 
is reduced to the corresponding theorem for the group g x , whose 
standard series contains but r members. 

(2) If g x and g' x do not coincide we construct the inter- 
section f) of g x and g' x , i.e. the set consisting of all elements 
common to the two. f) is then an invariant sub-group of g' x 
and, as we shall prove, g' x /f) is isomorphic with g/g x . That 
two elements s, t of g are equivalent with respect to g x , i.e. that 
they belong to the same “ set,” is expressed by the equation 
t = a x s where a x is in g x . If s and t are at the same time elements 
of the sub-group g' x , then a x is also in g' x and consequently it 
is an eletnent of f). We may therefore consider as the elements 
of g'i/f) those sets in g which contain an element of g' x . The 
elements contained in these classes then constitute an invariant 
sub-group § of g containing both g x and g' x , and g' x /f) is simply 
isomorphic with §/g x . But since g' x is maximal either ig == g 
or ip = g' x . The second case implies that g x is contained in g' x , 
and since it is maximal it must coincide with g' x , contrary to 
assumption. Hence £) coincides with g and our assertion is 
proved. The intersection f) of g x and g' x depends symmetrically 
on both, whence g/g' x and g x /f) are also simply isomorphic. 

We now proceed as follows. We construct a composition 
series for f), which we denote simply by f), • • •, and coriipare 
the following four composition series of g : 

9, Bi> 92) 

9> Si) \ 

9, 8 it I, ' • ' 

9, 8'x, 9'* ’ • ‘ 

The comparison of the first and second series is reduced to case (1). 
The second and third series agree from the member!) on, and 
the two foregoing factor groups 

9/8i, 3i/f> 



134 GROUPS AND THEIR REPRESENTATIONS 


are, as we have seen, simply isomorphic with 


0/9'x, fl'i ft 


on interchanging their order. The comparison between the 
third and fourth series is again reduced to the case (1). The 
proof of the theorem for composition series containing r + 1 
members is thus reduced to the proof of the corresponding 
theorem for series with but r members, and since it obviously 
holds for r = 2 (i.e. for simple groups) the method of mathe- 
matical induction establishes its general validity. 

The close methodological agreement between the construction 
involved in the proof of this theorem and that involved in the 
proof of the independence of the cardinal number of a set of 
the order in which the objects are enumerated is immediately 
evident. 

E. Noether 4 has given a generalization of the Jordan-Hblder 
theorem which is of importance for us. A correspondence 
s-+ s' = As oi the group on itself is said to be automorphic if 
multiplication is invariant under it, i.e. if (st)' = s't' — we here 
neither assume that different elements $ generate different 
elements s' nor that for a given element 5 ' there exists an element 
5 such that s s' in virtue of the automorphism. Let 2 be 
a system of such automorphic correspondences of g. We now 
admit only sub-groups of g which are invariant under 2 } i.e. 
sub-groups whose elements are carried over by all operations 
of the system 2 into elements of the same sub-group. We say 
that two such “ allowed ” sub-groups and g 2 have the same 
structure if we can set up a one-to-one simple isomorphic 
correspondence between the elements of the one and the ele- 
ments of the other in such a way that every operation A of 
the system 2 sends corresponding elements of the two sub- 
groups over into corresponding elements. The Jordan-Httlder 
theorem still holds under this modification ; its proof can be 
aken over unaltered. 


The vectors of an n- dimensional vector space 5ft constitute 
an Abelian group whose multiplication is the addition + of 
vectors. We must for the moment supplement addition by 
the operation of multiplication of a vector by an arbitrary 
number ; hence the concepts and theorems applying to vector 

S ? a Ak a r e n0t truly s P ecialization s of the concepts and theorems 
of Abelian groups, but there exists a thorough-going analogy 

t£TS th ^ ; In dicating this analogy between a group (on 

} V6Ct0r Sp , ace (on the right ) b y ~ we have, for ex- 

cS£oon U d b e nr r r P T^ *7™ , Sub ' Space ’ automorphism ~ linear 
orrespondence. Indeed, a linear sub-space is a system {ft' of 
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vectors such that with £ and t) their sum g + tj and the product 
Aj by an arbitrary number A also belong to SR', and a corre- 
spondence j -> £' = A% is linear if it sends J + t) and Aj over into 
l' + t}' and Aj', respectively. Every “ sub-group ” is here 
invariant, as we are dealing with Abelian groups. If 9t' is 
a sub-space of 9i the space 91 (mod. 91') obtained by projecting 
9* with respect to 91' is the exact analogue of a factor group. 
A composition series consists of a sequence of spaces each 
member of which is a linear sub-space of the preceding one 
and has one less dimension. The last member is the space 0, 
consisting of the vector 0 alone, and the number of members in 
the series is 1. greater than the dimensionality w. The Jordan- 
Hdlder theorem is here valid but trivial. 

On the other hand, this theorem is of considerable importance 
on going over to Noether’s generalization. Consider a system 
E of linear correspondences of the vector space 91 on itself ; the 
terms invariant, equivalent, reduction shall in the following refer 
to this system. Two invariant sub-spaces 9t x and 91 2 are similar 
or equivalent if a one-to-one linear correspondence can 

be set up between the vectors of the one and the vectors of the 
other in such a way that any operation A of the system sends 
corresponding vectors over into corresponding vectors. On 
reading the series (4.3) established in § 4 backwards, we have 
the exact analogue of the composition series : each member of 
the series is followed by a maximal sub-space which is invariant 
under E. (The possibility of constructing the composition 
series in increasing as well as decreasing order is due to the 
fact that the addition of vectors is commutative.) Furthermore, 
we can obtain the concepts and theorems relating to a system 
E of correspondences as genuine special cases of those of group 
theory, and not merely as analogues, by supplementing the 
system E with all similarity transformations, i.e. by all corre- 
spondences of the form £ £' = Aj representing multiplication 

by an arbitrary number A. The Jordan-Hfjlder-Noether theorem 
now states : Given a second composition series 

o, 94 94 • • 9ft, (6.3) 


the corresponding projection spaces 

94 % (mod. 94, 9*3 (mod. 94, • • * 
are equivalent to the projection spaces (4.3) 

34 % (mod. 94, 9t 3 (mod. 34, • • * 
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of the original series, taken in a suitable order. The number 
of members is, of course, the same in both. The reader is 
advised to reconstruct the proof of this theorem by carrying 
through the proof of the Jordan-Holder theorem step by step 
for this case. 

In particular, if the system E consists of the transformations 
U(s) associated with the various elements s of a group in a 
representation § : $ U(s), our result yields the 

Uniqueness theorem : The irreducible representations separated 
off from § by successive reduction are completely determined by 
except for the order in which they occur , considering equivalent 
representations as the same . In particular , the complete reduction 
of § into irreducible components is unique , always considering 
equivalent representations as the same . 

§ 7. Unitary Representations 

For the case in which the representation space 9ft is unitary 
and the correspondences U(s) of 9ft on itself, associated with 
the element s of the group under consideration, are also unitary, 
certain of the concepts introduced above are to be modified 
accordingly. Two representations 

* -* 17(f), s U'(s) = AU(s)A~\ 

are to be considered as equivalent only if A is unitary, i.e. if it 
is a transformation from one normal co-ordinate system in 
9ft to another such. If 9t' is a sub-space of 9ft a unitary-orthog- 
onal co-ordinate system can be set up in 9ft' and supplemented 
by additional fundamental vectors to form a complete unitary- 
orthogonal co-ordinate system for the entire space 9 1 : every 
sub -space of a unitary space is per se unitary. Invariance and 
reduction remain as before, but we allow only those decom- 
positions of 9ft into two sub-spaces 9t 2 + 9ft 2 in which 9ft x , 9ft a 
are perpendicular. For a system of unitary correspondences 
reducibility implies complete reducibility and we have the theorem : 
If 9ft' is invariant with respect to E then 9ft may be broken up into 
9ft' + 9t" in such a way that 9ft" is also invariant under E. We 
need merely t o define 9fi" as the space defined by all vectors per- 
pendicular to 9t'. The theorem naturally holds for the case in 
which E is a system of infinitesimal unitary correspondences or, 
what amounts to the same, a system of Hermitian forms. The 
theorem developed in the preceding section proves that these 
irreducible components are uniquely determined, in the sense 
of (unitary) equivalence, to within a permutation. 
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Examples 


(1) The Unitary Group in Two Dimensions 

The group c = C 2 of linear transformations in two dimensions 
contains the sub-group u = u 2 of unitary transformations. 
Hence the representation ©/ of c obtained in § 5 is also a repre- 
sentation of u. This representation is not unitary as it stands, 
but it can readily be made unitary by a slight change. The 
transformation of ©/ corresponding to the unitary transforma- 
tion s of the co-ordinates x, y is that induced by s on the monomials 

x n = xy k (i + k=f) (7.1) 

of order /. For purposes of symmetry we label these co-ordinates 
with the index n — i — k which runs through the values 
/, / — 2, • • -, — /. This is also desirable because on restricting 
ourselves to the sub-group of “ principal transformations ” 

1 

x-^ ex, y - y 


x n is multiplied by the factor e n . We now employ, instead of 
(7.1), the variables 


x l y h 

Vi ! k ! 


(7.2) 


obtained from them by multiplication with a constant. The 
representation ©/ of u will then be unitary, as follows from the 
equation 


/! 


(xx + yy)f = Z 


% iyk , 


fiiyk 


Hkl 




We call ©/ even or odd according as /is even or odd. The even 
representations associate the identity 1 with the reflection 


x' = — x t y f — — y, 

and the odd associate with it the transformation — 1. ©/is 
also irreducible when considered as a representation of u, and 
on letting / assume the values 0, 1, 2, • • • they form a complete 
system of inequivalent irreducible representations of u. The proof 
of these assertions, which we employ heuristically in the follow- 
ing, will be given in Chapter V. On writing a homogeneous 
polynomial of order / in the variables x, y in the form 


E d n x n 

the coefficients a n transform under the influence of a unitary 
transformation 5 like the components of a vector in the repre- 
sentation space of ©/. 
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The complete reduction 

■was accomplished by breaking up the space of the “ polynomials 
” into two invariant sub-spaces (0) o and (&)'. We must 
now verify that these two sub -spaces are mutually orthogonal 
in the unitary sense. A general polynomial €> may be written 




= g, g “ 2, 



where the x n are given by (7.2) and the are the corresponding 
monomials 


_ |Y 


(* + * = g, L — /c = v). 


Two such polynomials <J> with coefficients a n „, are orthogonal 
if 

jE&nv ^nv ^ 

The polynomial whose highest coefficients a/ 9 = 1 while 

all others vanish, is to within a constant factor w • £0 and is 
obviously perpendicular to all polynomials (&)', for in all these 
latter the coefficient of xf& vanishes. But under the unitary 
transformation 

s : x' == ooc + fiy, y' = — fix + ay, (7.3) 

where oca + fifi = 1, goes into 

(<xx + Py)f (a£ + /Jij)*. (7.4) 

Since (&)' and the orthogonality of polynomials are both in- 
variant under the unitary transformation s, (7.4) is also orthog- 
onal to (#)' and, with the help of the definition (5.12) of (0) a , 
it follows from this that all polynomials of (#) 0 are unitary- 
orthogonal to those of ($>)'. 

(7.3) is the most general unimodular unitary transformation. 
This is derived in the same way as the familiar formula for the 
orthogonal transformations of two variables with unit deter- 
minant in plane analytical geometry. On writing the coefficients 

a = #c + A, j8 = - /x + iv (7.5) 

in terms of their real and imaginary parts we see that each such 
transformation is characterized by four real parameters k, A, /a, 
the sum of whose squares is 1. The composition of two trans- 
formations s : (*, A, fi , v) is accomplished in terms of these 
parameters by Hamilton's quaternion multiplication ; this latter 
led to the vector calculus. 
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(2) Unitary Groups in n-Dimensions 


The totality of tensors of order / is the .substratum of an 
trf - dimensional unitary representation (u)/ of the group it = u w , 
for on denoting the components of an arbitrary tensor by 
F(i x i 2 * • • if) the sum 


E \F{iih 

(h> ' * *, if) 


v)l 2 


(7.6) 


is a unitary invariant. On restricting ourselves to the 

-dimensional linear manifold of anti-symmetric tensors we 

take as the variables in tensor space those components 
P{i\H * * * if) for which i x < i 2 < * • • <if. The sum (7.6) 
for these components only is, however, equal to the complete 
sum (7.6) divided by /!; hence the representation {u}^ of u, 
whose substratum consists of all anti-symmetric tensors, is 
unitary. The situation is somewhat different for symmetric 
tensors. The most general symmetric tensor of order / trans- 
forms like J X f X • • • X { (/ terms), i.e. we may for the 
present purpose set 



F (hh ‘ • • if) = K *;,••• x if . (7.7) 

We write the monomial on the right in the form 

x{ 1 x{' • • • %l n (5.1) 

as before ; f r is the number of times the index r appears in the 
series i 1} i 2 , • * •, if. In this sense we write the components of 
a symmetrical tensor 


F(hH • •. • if) == ft, • • •, /„). 


The sum (7.6) becomes in this case 


E jT J jr f7~ .fj Wi’f* ■ • 

extended over all integral f r ^ 0 for which f x + / 2 + * • • +/«==/• 
The coefficient indicates how often the term \F(i x i 2 • * • i/)\ 2 
occurs in the sum in consequence of the fact that its value is 
unchanged on permuting the indices. We must therefore 
consider the quantities 

^(/l> A? * * *> fn) 

VAlfti •••/»! 


as independent components of an arbitrary symmetric tensor 
of order / in order to obtain a unitary representation [u]A 
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The truth of this assertion follows from the fact that the special 
tensor (7.7) satisfies the equation 


jf * A + 


* + x n x n y = S 


A" 


rfn . xfi , 


A 


■fnl 


(7.8) 


We have already seen in I, § 5 that a normal co-ordinate 
system can be so chosen that a commutative system E of 
unitary correspondences is completely reduced to a set of 
1- dimensional systems. The only irreducible unitary repre- 
sentations of an Abelian group are accordingly 1- dimensional. 
For it follows from 

U(s)U(f)= U(st) (4.1) 

and the Abelian character of the group that the unitary matrices 
U(s) associated with the elements s are commutative. 

If § and are unitary representations, then § + 

§ X are also. — The first fundamental problem for a given 
group g is to find a complete system of inequivalent irreducible 
unitary representations of g, for then any unitary representa- 
tion of g can be obtained by the addition of these irreducible 
representations. The second fundamental problem is to reduce 
the product X $q' of two irreducible representations ip, of g 
into its irreducible components ; or better (after having solved the 
first problem), to determine how often each of the irreducible 
representations occurs in this product . 

We illustrate these problems on the example offered by 
rotation groups, which are of particular importance in quantum 
physics. 


§ 8. Rotation and Lorentz Groups 

(a) The Group of Rotations in the Plane 

We describe the 2-dimensional plane by a complex co- 
ordinate x , The rotations of the plane are then given by 

x-+x' = ex, (8.1) 

where s — e*+ is a constant with unit modulus. (The rotations 
of .the real 2-dimensional plane thus coincide with the unitary 
transformations of a single complex variable.) The angle of 
rotation <f> determines the rotation completely, but it is of course 
only determined mod. 2n by the rotation. The angle of rotation 
behaves additively on composition : the rotation <f> followed by 
the rotation <f>' results in the rotation </> + <f>'. This rotation 
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group is accordingly a one-parameter continuous Abelian group. 
We obtain a 1-dimensional representation of our rotation 

group b = b 2 by associating with the element e, (8.1), the linear 
correspondence 

* -> x' — e m • x = e im * • #, (8.2) 

where tn is any fixed integer. I assert that the ®< m >, m running 
through all integral values, constitute a complete system of 
irreducible unitary representations of b 2 . This can be seen as 
follows. 

Any irreducible representation is necessarily 1-dimensional : 
it associates with the rotation <f> a number x{<f>) absolute value 
1 such that 

x{<f> + <t>') = xii>) • xit')- 

We assume that our representation is continuous ; then x(<A) 
is a continuous function of <j> with period 2n. First, x(0) = 1 • 
We write x{<f>) = and determine A (<f>) uniquely by the require- 
ments that A(0) = 0 and that X(<f>) shall be a continuous function 
of <}>. We then have 

X(<f> + <f>') — M4>) + A(f), (8.3) 

for the right- and left-band sides of this equation could at most 
differ by an integral multiple of 2 tt, but as it is written both 
sides agree for <f>' — 0 and vary continuously with <f>’. (8.3) 

satisfies the condition A(0) — 0 and we obtain from it the further 
equations 

A(— <f>) =— X(hf,)=h-X(<f>), (8.4) 

where h is any integer. On replacing <f> in the second of these 
equations by <f>lh we obtain 

*(!)->■ (8 - 5) 

It follows immediately from (8.4), (8.5) that for every rational 
number kjh ( k , h integers) 

In accordance with our assumptions A(27 t) is an integral multiple 
2 w 7 r of 27r. On setting <f> — 2 tt in (8.6) we obtain the equation 
A(<£) = m<f> for all </> which are rational fractions of 2 t r ; the 
continuity requirement then allows us to assert its validity 
for all real values of the argument <f>. 
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The simple equation 

$<"*> X ® (m,) = ®< OT+m '> 

is here valid. 

Consider the function f(p) on the unit circle in the complex 
x plane. If the point p goes over into the point p' under the 
rotation e, the function / goes into a function f which is defined 
by the equation 

fiP') = /(/>)• 

The transition /->■ f is a linear correspondence in the oo-dimen- 
sional space of functions f(p) and is associated with the rotation 
e ; this obviously defines an oo- dimensional representation of 
the rotation group b 2 , which we denote by $. $ is unitary if 

we take as the square of the absolute value of a “ vector ” / 
the integral of |/(/>)| 2 with respect to the element of arc dp on 
the unit circle. The fact that any function (satisfying suitable 
conditions) on the unit circle can be developed in a Fourier 
series means that in the reduction of $ into its irreducible com- 
ponents each of the 1-dimensional representations 2)( m > occurs 
once and only once. More precisely, this reduction is to be inter- 
preted with regard to the completeness relation. 

ib) The Group of Rotations in 3- dimensional Space 

We consider the functions / = f(P) on the unit sphere as 
the vectors of an oo- dimensional unitary space whose metric 

is given by ^\f(P)\ 2 d<o ; du> is the surface element of the sphere 

over which the integration is to be extended. If the point P 
goes over into P f = sP under the rotation s, the function f 
goes over into the function /' defined by f'(P') = /(P). The 
surface harmonics Y x of degree l [cf. II, § 4] obviously span a 
(2/ -f 1) -dimensional sub-space which is invariant under the 
totality of transitions /->/' induced in function space by the 
various elements 5 of the rotation group b = b 3 — here again we 
speak of this representation as They are consequently the 
substratum of a certain representation of b which is induced 
in 9 ti by b. On choosing a definite direction as that of the 
s-axis we may, as in II, § 4, take the set 

= -t) 

as a basis for the surface harmonics of degree l. We then have 
a unitary representation, and the sub-spaces corresponding 
to the various values 0, 1, 2, • • * oil are mutually perpendicular 
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in the unitary sense (orthogonality properties of surface har- 
monics). b contains the 2-dimensional rotation group b 2 — e.g. 
as the sub-group of rotations about the s-axis. The structure of 
7 Z (OT) shows that on restricting b 3 to this sub-group b 2 the 
representation is reduced into the 1-dimensional representa- 
tions for which m = l, l — 1, • • •, — The fact that 
any function on the unit sphere possesses a unique expansion 
in terms of surface harmonics means that on reducing $ into 
its irreducible components each of the representations l = 0, 
1, 2, * • *, occurs exactly once. This reveals the true signifi- 
cance of surface harmonics ; they are characterized by the 
fundamental symmetry properties here developed, and the 
solution pf the potential equation in polar co-ordinates is merely 
an accidental approach to their theory. 

Rotations are orthogonal transformations of three variables 
x, y, z. If we wish to include with the proper rotations with 
determinant + 1 also the improper ones with determinant — 1 
— “ augmented rotation group b’ ” — this can be done by intro- 
ducing the reflection 

i:x'=-x, y'=—y, z' = - z (8.7) 

in the origin. Its reiteration ii is the identity, and it commutes 
with all rotations. The matrix corresponding to it in the 
representation defined by the surface harmonics of degree l is 
the (21 + 1) -dimensional matrix (— • 1)*, for the surface harmonics 
of degree l are homogeneous polynomials of degree l in x , y, z. 
We can thus obtain two representations 3)j + , ©f of the aug- 
mented rotation group from the representation ©j of proper 
rotations ; these two coincide with ©j for proper rotations, 
but in the first the matrix associated with the reflection i is + 1 
whereas in the second it is — 1. We call this db 1 the signature 
of the representation. Hence in the oo -dimensional repre- 
sentation ® of the augmented group b’ each ©j occurs once 
with signature (— l) 1 , but not with the opposite signature. 
Although we are not as yet in a position to prove it, the 
©, (1=0, 1, 2, • * •) constitute a complete system of in- 
equivalent irreducible (single-valued) representations of the 
rotation group b, and the © z + , ffif together constitute such a 
system for the augmented rotation group b\ 

Now consider the unitary function space of all functions 
f(P) in 3-dimensional space for which the integral |/| 2 over all 
space is finite. Let the representation induced in this space 
by rotations s, in which the transition from / to the transformed 
function f = sf is associated with s, be denoted by Each 
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function f(P) can be expanded in a series of terms of the form 
<f>(r) * Y v Choose a complete ortffogonal system &(r), <£ 2 (r), • • • 
in the domain of functions <f>(r) of the radius r, in the sense of 
the equations 

00 

J r*$ m {r)<t> n {r)dr = S mn . 

0 

The functions of the form <fi n (r) • Y x then constitute a {21 + 1 )- 
dimensional sub-space 9l n r which is invariant under rotations 
and in which 6 induces the representation ©* Different 3ft nl 
are mutually unitary-orthogonal. Each then appears in 
6 infinitely often, its various occurrences being distinguished by 
the 11 radial quantum number ” n. Consider the analysis of 
single electron spectra given in Chap. II, § 5 , in the light of these 
mathematical developments. We then see that the azimuthal 
quantum number l is of purely group-theoretic significance, 
whereas the radial quantum number n refers to the dynamical 
situation, for the manner in which the orthogonal system <f> n (r) 
is to be chosen is determined by the dynamical differential 
equation. 

The proper rotations of 3 - dimensional Euclidean space about 
the origin of Cartesian co-ordinates x , y , z, i.e. the real orthog- 
onal transformations with determinant + 1, are most easily 
represented by a stereographic projection of the unit sphere 
about the origin on to the equatorial plane z = 0, the south pole 
of the sphere being the centre of projection. If the point 
{x\ y\ 0) be the image on the plane of the point [x, y ) z) on the 
sphere and we write £ = x r + iy\ the formulae for the projection 
are 

, • . 2 1 1 ~£ l 

x+iy nnr * iy ~TTt?’ 

But it is preferable to introduce the two homogeneous complex 
co-ordinates £, 17 in place of £ by means of the equation £ = 17 /£ ; 
the south pqle £ : 17 = 0 : 1 is then included. We then have 

x + iy:x — iy: z : 1 = 

: 2£i 7 : — 1777 : -f 77 fj. 

Accordingly each unitary transformation 

a • £' = <*£ + pr /, 77' = + 877 

of the co-ordinates f, 17 corresponds to a rotation s of the sphere, 
the points of which are represented by the rays f : 17 of 2- dimen- 
sional unitary space. Since, as is readily seen, any point and. 
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tangential direction through it on the sphere can be carried 
over into any other such configuration on the sphere by means 
of such rotations, we obtain in this way all rotations. Since 
we are only concerned with the ratios of the coefficients a, j8, 
y, 8, the arbitrary factor of proportionality may be chosen in 
such a way that the determinant of the transformation is 1. 
Nevertheless this normalization is somewhat artificial as the 
correspondence is still double-valued, for on multiplying the 
coefficients of the unitary transformation by — 1, i.e. on going 
over from a to — o*, the normalization is unaffected. Hence to 
each element cr, (7.4), of the unimodular unitary group u corre- 
sponds a rotation s : a s under which the co-ordinates 
x + iy } x — iy ) z transform like 

m m, (8.8) 

or 

% + y ~ — d)t z ~ — vv- ( 8 * 9 ) 

(The symbol which we occasionally employ, means that the 
expression on the left transforms like the one on the right.) 
We obtain in this way all rotations, each one exactly twice. 
The rotations about the 2 -axis are obtained from the ‘‘ principal 
transformations ” 

f = V = \v 

of u. In fact, on setting s = e iu> = e(<o) the angle of rotation 
about the 2 -axis is ^ = — 2o>. In virtue of the correspondence 
a s the rotations in 3-dimensions constitute a representation 
of the group u ; and, conversely, the association s -> o is a 
representation of the group b .= b 3 of 3-dimensional rotations 
by u, although this representation is double-valued. In virtue 
of this correspondence s or any representation U(a) of U yields 
a representation of b 3 (“ T process,” § 5) ; may thus be thought 
of as a representation of b 3 , in which case we write it where 

j — The (“even”) with integral j are single-valued, 

those with half-integral (i.e. half an odd integer) j are double- 
valued. On restricting the group b 3 to the sub-group b 2 of 
rotations about the 2 -axis is reduced into the 2 j + 1 one- 
dimensional representations (m = /, j -— 1, • * *, — j). To 
show this we first note that the substratum of our representation 
$)* consists of the monomials (7.2) 

x{m) = ~vrtn ( I ’ + fe==2 ^> i — k=2m), 
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where m runs through the values j y j — 1, • • •, — j. The 
transformation induced on these variables by a rotation </> 
about the £-axis is accordingly 

x{m) e(— m(f>) * x(m). 

The representation a -> s of it is itself contained among the 
representations % of u constructed above ; it is, in fact, 

To show this we note that if (£, 77), (£', 77') be subjected to the 
same transformation a of u, then the determinant £7/ — 77^', as 
well as £1 + 7777, is invariant. Consequently (£, 77) transform co- 
grediently to (77', — £'), or as (77, — £) ; hence 

x + iy ~ 77 2 , # — iy 0 ^ £17. (8.10) 

The representations with integral j are identical with those 
obtained above as the representations induced on surface har- 
monics of order j , for each polynomial in x, y, z of degree j is, 
in virtue of (8.10), equivalent to a form of order 2 j in £, 77. 

If we wish to augment it = u 2 in a manner paralleling the 
augmentation of b = b 3 by the improper rotation i (reflection 
in the origin) we must consider it as an abstract group rather 
than a group of linear transformations in two variables. Denote 
the element corresponding to i by 1 and the elements of the 
original it by a as before. We define the augmented it* as the 
totality of elements of the types a and ia ; 1 must naturally 
obey the multiplication laws 

ia — ox, u = 1. 

and 6“ are then those representations of it’ which coincide 
with for elements of the restricted group it and which as- 
sociate with the element 1 the unit matrix + 1 and its negative 

1, respectively. The sign + is again called the signature. 
The representation ©£“ associates the augmented rotation group 
b* 3 with it’. 


(c) The Lorentz Group 

Let the 3-dimensional Euclidean space be referred to homo- 
geneous projective co-ordinates (a = 0, 1, 2, 3) defined by 



The equation of the unit sphere is then 

'"‘‘#0 + *i + #2 + tf 3 =0 


( 8 . 11 ) 
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and the formulae for the stereographic projection considered 
above become 

x 0 = ll + yy, x \ — h 4- yi 

*2 = — *£)> *3 = II ~ VV 

On subjecting £,* rj to an arbitrary linear transformation a the 
x# undergo a corresponding real linear transformation s which 
leaves the equation (8.11) invariant. If the absolute value of 
the determinant of cr is 1, we can readily show that the form 

— #o “ 1 “ #i + x 2 + #3 (8.13) 

is itself invariant under the corresponding s, and that the deter- 
minant of s is -f L 

We now consider x 0 = ct, x Xi x 2 , x 3 as the co-ordinates of 
space-time; (8.11) is then the equation of the light-cone, the 
generators of which are the possible paths for a beam of light. 
In the restricted thebry of relativity normal co-ordinate systems 
for space-time are connected with each other by arbitrary 
Lorentz transformations , i.e. by any real linear transformation 
which leaves the form (8.13) invariant and which does not 
interchange past and future. Lorentz transformations con- 
stitute a group, the 44 complete Lorentz group,” and this group 
describes the homogeneity of the 4-dimensional world. This 
group consists of 44 positive ” and 44 negative ” transformations, 
i.e. transformations with determinants + 1 and — 1, respectively. 
The first constitute the 41 restricted Lorentz group,” from which 
the complete group is obtained by introducing in addition the 
spatial reflection 

x 0 -* # 0 , -> — (a = 1, 2, 3). (8.14) 

Under the restricted group right and left, as well as past and 
future, are fundamentally different. Since the expression for 
x 0 in (8.12) is positive definite, we may state the result obtained 
above in the form : any linear transformation of f , rj } with deter- 
minant of absolute value 1 , induces a positive Lorentz transforma- 
tion s in the x u . Transformations o which differ only by a factor 
e ix of absolute value 1 give rise to the same s. The correspondence 
<7 — s is naturally a representation. 

The question of whether every positive Lorentz transformation 
5 can be obtained in this way arises immediately. That this 
is in fact the case can be seen from general continuity con- 
siderations, for the positive Lorentz transformations constitute 
a single connected continuum. But it is also easily proved by 
elementary methods. Since we have seen in ( b ) above that the 
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rotations of space s are obtained from the unitary transforma- 
tions cr, we need only to examine the Lorentz transformation 

(*o + *3) -*• a*{x 0 + x 3 ), {x 0 — x 3 ) -> — x 3 ), 

CL 

affecting the time axis, where a is a real non-vanishing constant. 
But this transformation is obtained from the unimodular a : 

£-*■«£» 17-^17. 

Returning to the general case, the correspondence s-*-o is a 
2-dimensional representation of the restricted Lorentz group. 
But a is determined by S' only to within the arbitrary “ gauge 
factor ” e a ; we may therefore normalize it by the condition 
that the determinant of a shall itself be unity, not merely its 
absolute value. Even so, a remains double-valued, for — a 
satisfies the normalizing condition as well as a. This repre- 
sentation s~>a contains the representation of the rotation 
group considered in ( b ) on allowing 5 to run through the sub- 
group of spatial rotations contained in the restricted Lorentz 
group. 

The expressions (8.12) are Hermitian forms with matrices 


l 0 

0 

S t -•« 

1 

) 

0 — i 

s 3 = . 

1 

, s 3 = A 

0 

0 1 ’ 

1 

o| 

i 0 

0 

-1 


Hence if jr denotes the one-columned matrix with elements f, 17 
equations (8.12) may be written 

x„ == ( 8 . 16 ) 

On replacing g, 17 by ij, — £ the x a undergo the spatial re- 
flection (8.14). That is one way of including the negative 
Lorentz transformations. But if we require that the corre- 
sponding transformation of g, t] be linear, we must introduce in 
addition to j ■■■■• (£, 17) a second pair j' = (£', rj') which undergoes 
the transformation c' contragredient to <J. Then 

(17, ■■ ■ £)~(g', rj') to within the factor d, 

( rj , — £)'-*'(£', rj 1 ) to within the factor d, 
when; 1/ is the determinant of a. Defining 

.% s u , s:~-s a (« = 1 , 2 , 3 ), 

the quantities 
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undergo the same transformation 5 as (8.16), provided the 
absolute value of the determinant of a is 1. The same is true 
for any linear combination of the two, e.g. Hence the 

quantities 

= + (8.17) 

undergo the given positive Lorentz transformation s when rj 
are subjected to a certain transformation a and simultaneously 
£', 7 j' to the transformation ex' contragredient to a. Furthermore , 
they undergo the transformation (8.14) on interchanging the two 
pairs £, £', i.e. on subjecting the four variables to the trans- 
formation 

ij-m'; £'-*£, (8.18) 

The expression 

W + m 

is invariant in virtue of the transformation law of £', rj' defined 
above. To obtain an expression which is also invariant under 
the interchange (8.18) we must add to the above the expression 
obtained from it by this interchange : 

(If + vv') + (I'l + vv)- (8.19) 

It will be found advantageous to denote the column con- 
sisting of the four elements (£, rj ; rj') by a single letter £. 
Let that linear transformation of these four variables which 
transforms £, rj in accordance with S* and r[ in accordance 
with S* be denoted simply by S a : (8-17) then becomes 

We must now ask to what extent the linear transformation a 
of the four variables £ is determined by the requirement that 
it induce a given (positive or negative) Lorentz transformation 
5 of the Hermitian forms x*. It suffices for this purpose to 
inquire what transformations of the £ induce the identity on 
the variables x a . The only transformations of this latter kind 
are those which multiply f , 17 with a common factor e a of absolute 
value 1 and at the same time vf with any factor e ix ' (inde- 
pendent of the first) of absolute value 1. But a can be more 
precisely specified by the requirement that (8.19), i.e. £T£, be 
also invariant. The two arbitrary “ gauge factors ” e a , e a ' 
must then coincide : the substitution cr is then determined to 
within a factor e a . 

Our analysis reduces the problem of the representations of 
the Lorentz group to the corresponding problem for the uni- 
modular linear group c 2 . 
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§ 9. Character of a Representation 

The trace of a linear correspondence A , i.e. the sum of the 
elements in the principal diagonal of the matrix A , is an in- 
variant under transformations of co-ordinates which is of 
particular importance. The trace x( s ) °f the correspondence 
U(s) associated with the element 5 of the group g in a repre- 
sentation § of g is called the group characteristic , or, in 
order to avoid assigning yet another meaning to this second 
word, which has already appeared in another important con- 
nection in quantum mechanics, simply the character of the 
representation fp. Equivalent representations have the same 
character ; the name is so chosen because the converse of this 
theorem is true within wide limits. Since £7(1) = 1, the value 
of the character x(l) for the unit element is equal to the dimen- 
sionality of the representation. 

It follows from the equations 

U{asa-') = U{a)U{s)U(a- 1 ) = U{a)U(s)U~'{a) 

that the matrices U(s) and U(asa~ l ) differ only in their orienta- 
tion and consequently have the same trace : 

Xiasa- 1 ) = X {s). 

Now s and osar 1 are any two conjugate elements of the group g, 
i.e. they belong to the same class of conjugates in the sense of 
§ 3. We speak of a function f(s) on the group manifold which 
has the same value for all elements s belonging to the same 
class as a class function ; such a function can at most allow us 
to distinguish between different classes, but not between ele- 
ments of the same class. The distinguishing feature of class 
functions can also be expressed in the equation 

m =/(*)• 

The validity of this equation for / = x follows from 
U(st) = U(s)U(t), U(ts) = U(t)U(s) 

and the fact that the trace of the matrix AB is equal to the 
trace of BA. 

The character x{s) of a unitary representation : U(s~ 1 )=0*(s), 
satisfies the equation 

X(s~') = (9.1) 

We shall say that the characters of irreducible representations 
are primitive. Any unitary representation § can be reduced 
into its irreducible components, and the normal co-ordinate 
system in the corresponding sub-spaces can be so chosen that 
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two irreducible constituents are equal if they are equivalent. 
If in this sense 

§ = ml) -f- m’tf + • • (9.2) 

where I), 1)', • • • are inequivalent irreducible representations and 
w, m' * • • are the numbers of times they occur in then the 
character X of § is expressed in terms of the characters x, X> * * ’ 
of f), f)', * • * by the equation 

X(s) = m x {s) + m'x'{s) + • • -. (9.3) 

From an n-dimensional representation ^:s~^U(s) i with 
the character xi s )i an d an ^'-dimensional U'(s) of 

character xi s ) we can construct the (wn')-dimensional repre- 
sentation § X The elements in the principal diagonal of 
U(s) X U'(s) are obtained by multiplying all elements in the 
principal diagonal of U(s) by those in the principal diagonal 
of U f (s) : the character X §' is consequently x{$) x'i s )• Again, 
if § is a representation of the group g, $' a representation of 
the group g', then the representation Jg X $?' of g X g' has the 
character £ defined by 

Z(s,s')=x(s) x '(s'), (9.4) 

where s runs through the elements of g and s' those of g'. 

We need not distinguish between, a 1-dimensional repre- 
sentation and its character ; the character satisfies the simple 
equation (4.2). This holds, for example, for the characters 
e(m<f>), eq. (8.2), of the rotation group b 2 . 

By the theorem on the transformation of unitary correspond- 
ences to principal axes, each element of the group u = u 2 is 
conjugate to a principal element, i.e. an element of the form 

s 0 

0 1. W = 1 (»•«) 

e 

The characteristic values e, 1/e are determined to within the 
order in which they appear. Introducing the angle to by the 
equation e = e(oi), to characterizes a class of conjugate elements 
of U ; we are only concerned with co mod. 27 t, and furthermore 
the class — at coincides with the class to. Since for any re- 
presentation © of u the character x (s) depends only on the class 
of the element s, it suffices to calculate it for elements of the 
form (9.5). It must be a periodic function of the angle co with 
period 2 n, and it must furthermore be an even function of to ; 
its value for ©/• is 

1 e /+i _ e -(/‘- 1) 

X/ = £f + ef ~* + • • • + s ! — — _ _ _-i ■ 


(9.6) 
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The characters of the representations considered in the 
other examples of the preceding section are just as readily 
calculated. 

§ 10. Schur’s Lemma and Burnside’s Theorem 

Lemma (10.1). 5 Assumption . Let 27 be an irreducible system 
of linear correspondences of an m-dimensional vector space x 
on to itself, and £2 such a system of an n-dimensional vector 
space 3. A linear correspondence A shall satisfy the equation 

EA = AQ (10.2) 

in the following double sense : for each U of E there shall exist 
a V of £2 such that 

UA = AV, (10.3) 

and conversely for each V of £2 there shall exist a U of 27 such 
that this relation is fulfilled. 

Assertion. Either A = 0 or m = n and det A =t= 0 ; in the 
latter case 27 and £2 are equivalent. 

Proof. We first make use of the assumption that 27 is 
irreducible in connection with equation (10.2) in the first sense. 
Considering the k ih column 

a lh) a 2 k) ■ * ’> a mk 

of A as a vector a^ k \ equation (10.3) asserts that the vector 
U&W associated with through the correspondence U is 
a linear combination of the vectors specifically that 

UaM = 2v»i&), F=|MI- 

h 

Consequently the sub-space of t spanned by the n vectors aW 
is invariant under 27, But because of the assumption th^t E 
is irreducible either = 0, A = 0, or the span the entire 
space x, in which case m of them are linearly independent ; 
this latter is possible only if n ^ m. That our conclusion 
contains two possibilities is due to the fact that the concept 
of irreducibility contains such an alternative. 

The second part of the assumption can be given a simple 
geometrical interpretation on going over to the transposed 
matrices : £2* is irreducible and for each V * of £2* there exists 
a U* of 27* such that 

V*A* = A*U*> 

The reasoning employed in the first part of the theorem allows 
us to conclude : either A* = 0 or m ^ n. We summarize the 
results thus far obtained in the statement : Either A = 0 or 
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m = n; in the latter case the m = n columns of A are 
linearly independent, i.e. the determinant of A does not vanish. 
But then U and V are determined uniquely by the relation 
(10.3) and E and Q are equivalent. 

In formulating these results it is desirable to consider the 
case of equivalence separately : 

I. If the two irreducible systems E, Q are inequivalent, (10.2) 
can only be satisfied by A — 0. 

II. If E is an irreducible system a correspondence A commutes 
with all correspondences U of the system E : 

UA = AU (10.4) 

if and only if A is a multiple of the unit matrix 1. 

Assertion II follows from the lemma proved above by 
elementary methods and the fundamental theorem of algebra. 
For by the latter there exists a number a such that 
&tt(A — al) = 0, and since A = A — al satisfies (10-4) for 
all U if A does, we conclude that since det A — 0 we must 
have A = 0. 

Applied to representations, our results are : 

Fundamental Theorem (10.5). I. If U(s), s -> V(s) are 
two inequivalent irreducible representations of a group q, the 
equation 

U(s)A = AV[s) 

can be satisfied by no matrix A which is independent of s , except 
A = 0. 

II. A matrix A which is independent of s and which satisfies 
the equation 

U(s)A = AU(s) 

for all s is necessarily a multiple of the unit matrix 1 . 

If there exists a matrix A which satisfies U(s)A = AU(s) 
identically in 5 and which is not merely a multiple of the unit 
matrix 1 , the argument employed above supplies us with a 
constructive process for the reduction of the representation 
s -> U(s) with the aid of A. 

We now consider an application of these important results, 
which are fundamental for the entire theory of representations, 
in order to prove a theorem due to Burnside . Let E be a 
multiplicative system, i.e. if U, U ' are two correspondences in 
E then the product UU r is also a correspondence in E . This 
concept is somewhat wider than that of a group ; we need not 
require that U possess an inverse — its determinant may be 0. 

Burnside's Theorem (10. 6). 8 In an irreducible multiplicative 
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system E of linear correspondences U = \\u ik \\ of an n dimensional 
vector space on to itself the components u ik are linearly independent. 
This asserts that the only matrix L which satisfies the equation 

tr (UL) = Eh a u i k — 0 


for all matrices U of the system is L = 0. Contrary to the 
assertion, we assume there exist non-vanishing matrices satis- 
fying this equation; such matrices we shall call L-matnces. 
It is of course possible that every L- matrix whose first column 

hit ^21l *1 ^"1 

vanishes must itself vanish. But in any case we can find a 
definite column index h with the following properties : there 
exist non- vanishing /.-matrices whose first h 1. columns 
vanish and are such that if the h th column also vanishes then 
necessarily L = 0. We shall call L-matrices whose first A — 1 
columns vanish special L-matrices. They constitute a linear 
f amil y 0 f m ^ n dimensions ; we denote a basis for this family 

by 

Lri), L< 8 >, • • •, L<»>. 


The hP column of a special L-matrix will be written I. 
Since 2 is multiplicative the equation 


tr (U'UL) = 0 


is satisfied by each L-matrix, where U, U' are arbitrary corre- 
spondences of the system E. With L, UL is also an L-matrix ; 
obviously it is a special L-matrix if L is. Each of the matrices 

UL* 1 ), C/L«, • • •, ULW 

is therefore a linear combination of Lri), • • -, and each of 
the vectors Ul* 1 ), • • •, is a linear combination of the 

vectors fl 1 ), • • -, b m >. Accordingly the vectors !<*), • • -, I< m ) 
span a non-vanishing sub-space which is invariant under all the 
correspondences U, and in consequence of the irreducibility 
assumed above it follows that m — n and the vectors P 1 ), • ■ -, 
I<») span the entire w-dimensional space. The basis L< 1 ), • * •, L (fi ) 
of the family of special L-matrices can be chosen in such a way 
that Iri), • • l(») are the fundamental vectors of the space ; 

1 (1 > is then the column (1, 0, 0, • • •, 0), etc. Since then 

W = « lf 1W + • • • + M nr l(»> (10.7) 

we must also have 


ULO = « lr L(‘) -f • • • -f u nr Lf”l. 


( 10 . 8 ) 
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We now consider an arbitrary column, say the k & , of L. 
(This is of course of no interest if k < h, for the first h — 1 
columns vanish.) Suppressing the second index k, we now let 
^ = (^ii ‘ ‘ "> In) denote the & th column of L. Then in accordance 
with (10.8), equation (10.7) holds for the present 1, i.e. the k th 
instead of the h th column of L. Introducing for the moment the 
matrix 

1 4 " ■ • • <«|| 


ie> • • • «i 

consisting of the ft lh columns of L^\ • • *, U- n \ we may write 
(10.7) as the matrix equation 


UA = AU. 


But it follows from this that A must be a multiple of the unit 
matrix, i.e. 


4 r) = A • SJ, 



(r = i ) . 
(r 4= i) ’ 


or, returning to the original notation by adding the column 
index k ) 

&> = K-K 


Here we have, by the foregoing, A x = • • • = A^_x = 0, A* = 1. 
The equation 

tr (J7L<0) = 0 

becomes 

S u kr X k =0, (r = 1, • • •, »), (10.9) 

h ** 1 

i.e. all correspondences of the system Z* carry the vector A 
with components (X lf A 2 , * * *, A n ) over into the null-vector. 
In consequence of the irreducibility of S this vector must there- 
fore vanish, which is in contradiction with the equation A* = 1 ; 
Burnside’s theorem then follows by reductio ad absurdum. — If 
we know that the unit matrix is contained in the system 27, as 
is the case for a representation, we can conclude that A t - — 0 by 
taking U in (10,9) as the unit matrix. 

Reducibility requires that on employing an appropriate 
co-ordinate system all matrices U of the system 27 have an 
entire rectangle of vanishing elements and consequently implies 
a system of homogeneous linear relations between the components 
Uiic of a very special kind. Burnside’s theorem states that if 
there exists no system of homogeneous linear relations of this 
special kind, then there exists no linear dependence at all. The 
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real reason for this remarkable fact is of course to be found in 
the assumption that E is closed with respect to multiplication. 

If our system E consists of an irreducible representation 
which associates with the elements s of the group g the matrix 
U(s), we see from Burnside’s theorem that the components of 
U(s) are linearly independent. The method developed above 
can readily be extended to prove the same for the components 
of two or more inequivalent irreducible representations U(s), 
U'(s), ■ • \ 7 From this it follows that in particular there can 
exist no linear dependences between their characters x(s), x'(s), • • •. 
Any unitary representation § can be reduced into irreducible 
components ; the character of § is expressed in terms of the 
characters of these irreducible representations by (9.3). Since 
x(s), x( s ) are linearly independent the coefficients m, m', • • 
which give the number of times the irreducible representations 
f), appear in §, are uniquely determined. This con- 

stitutes a new indirect proof of the following result, which has 
already been proved in § 6 in a more general and more elementary 
way : The irreducible representations into which can be reduced , 
as well as the number of times they occur, are uniquely determined 
by no distinction being made between equivalent representations. 
Two unitary representations £)j and ip 2 are obviously equivalent 
if every irreducible representation which is contained in the one 
is contained in the other the same number of times. Hence 
if and are inequivalent the character of cannot be the 
same as the character of § 2 because of the linear independence 
of the primitive characters : a unitary representation is uniquely 
determined by its character alone , and its character may be used 
as a unique name for the representation itself. We here go no 
further into these extensions of Burnside’s theorem, which are 
due to Frobenius and I. Schur, as we shall obtain the same results 
by a more profound method in the next section under assump- 
tions which are more restrictive but which are sufficient for 
our purposes. 

We mention only one consequence. Jp, being representa- 
tions of the groups g, g', respectively, then | X is an irreducible 
representation of g x g'. Indeed, there can exist no homo- 
geneous linear relation with constant coefficients c ik , , K between 
the components Uijf^u^s') of U(s) x U'(s') except the trivial 
one c = 0. For on applying Burnside’s theorem for the 
irreducible system § we have 

E € ik ) « U^S ) — 0 , 

t, K 

and on applying it again for we must have c ik , tK = 0. 
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§ 11. Orthogonality Properties of Group Characters 

If the abstract group g is finite , then any representation 
U(s) is equivalent to a unitary one. To show this take 
any positive definite Hermitian form, e.g. the unit form, subject 
it to all transformations U(s) of § and sum over s, We thus 
obtain a positive definite Hermitian form H which is invariant 
under each of the transformations U(s). Now choose the co- 
ordinate system in such a way that H becomes the unit form ; 
then U(s), expressed in terms of these co-ordinates, is unitary. 
This same method of summation over the elements of the group 
gives rise to the fundamental orthogonality relations. 

Let !q:s-+U(s), $q':s-+U'(s) be two inequivalent irre- 
ducible representations of the finite group g, the former being 
g-dimensional and the latter ^'-dimensional. We write 

tf(s) = ||* tt (*)|| l U'(s) = \\u[ K (s)\\, 

U'y) = |K.(f)||. 

For a unitary representation §' 

KM = KM- 

If A is an arbitrary matrix with g rows and g' columns then 
obviously the sum 

2JU(t)AU'~ l (t) = B, (11.1) 

t 

taken over all elements t of g, is invariant in the sense that 

U(s)BU'~ l (s) = B. (11.2) 

In fact, the left-hand side of (11.2) becomes, in virtue of the 
fact that 5 U (s) is a representation of g, 

£U{t)AU'~ 1 {t), 

T 

where r = st, s being fixed and t running through all elements 
of the group. We therefore obtain equation (11.2) or 
U(s)B = BU'(s). 

In accordance with the fundamental theorem (10.5) it follows 
from this that B = 0, i.e. 

Z S u ik [t)ak*uUt) = 0. 

t k,K 

Writing 5 in place of t and remembering that the a ** are arbitrary 
numbers, we obtain the g 2 * g' 2 equations 

Zu ik [s)KM = 0, 

* 

or, in dealing with unitary representations, 

ZuiMKM — o. 


(11.3) 
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Taking the single irreducible representation s- -> U(s) in- 
stead of the two inequivalent representations §', we find by 
the same argument that the square matrix 

U(s)AU~ 1 (s) = B, 

found from an arbitrary square matrix A, must satisfy the 

U(s)B = BU(s). 

This requires, however, that B be a multiple of the unit matrix 1, 
i.e. 

S S w„(i) = a • 8*. 

I k, K 

the number a depends on the matrix A ) the dependence being 
of course linear and homogeneous. Taking as A that matrix 
which has as its only non- vanishing element a kK = 1, we obtain 
the equation 

j£Ui k {s}ii K J^s) = <x. Kk 3 t * t . ( 11 . 4 ) 

Now \\& iK (s)\\ * s the matrix reciprocal to ||tt uc (.?)|| : 

t 

On taking i = i in (11.4) and summing over i == 1, 2, • • g 
we find that 

A * Kb = g*Kk, 

where h is the order of the group g. 

Expressing the sum Z in terms of the mean value 3B = \ JJ 
our results may be written in the form 

SR{w«(*KW} = ji for ** = *» k = * (n.8) 

lo otherwise 

for any irreducible unitary representation § : s U(s) and 

= 0 ( 11 . 6 ) 

for any two inequivalent irreducible unitary representations 
$-> U(s), ,$*->■ U'($). The components of one or more inequivalent 
irreducible unitary representations constitute a unitary-orthogonal 
set of functions on the group manifold . 

It follows from these fundamental orthogonality relations 
that the components u ik (s), u u (s), • • • are linearly independent . 
Since the number of linearly independent functions of an argu- 
ment s which assumes but h values cannot be greater than h 
we must have 


g % + g t% + • • • ^ h. 



ORTHOGONALITY OF GROUP CHARACTERS 159 

On the left-hand side of this equation occur the squares of the 
degrees of any inequivalent irreducible representation of g. 

We obtain the orthogonality properties of the characters 
by writing k = i, k = i in (11.5), (11.6) and summing over 
these indices : 

Any primitive character satisfies the equation 

«*{*(»)} = 1 , (11.7) 

and the characters x{ s )> x'i s ) °f an V iwo inequivalent irreducible 
representations satisfy 

W{x(sMs)} = 0. (11.7') 

The primitive characters of inequivalent representations constitute 
a normal orthogonal set of functions. They are consequently 
linearly independent, and from this follow all the consequences 
discussed in the previous section. In particular, a representation 
of g can be unambiguously described by its character, no dis- 
tinction being made between equivalent representations. The 
number of times m the irreducible x occurs in the representation 
X is, following (9.3), given by 

( 11 . 8 ) 

and we have 

9R(X(,j)X(s)} = m 2 + m' 2 + • • \ 

This last equation offers a simple criterion for the irreducibility 
of a given representation in terms of its character x *• it is neces- 
sary and sufficient that the mean value of xx = |x| 2 — which is in 
any case integral — be unity. 

Since the characters are class functions we are in dealing 
with them concerned with an argument which runs through 
the K different classes of g ; there can therefore be no more 
than K linearly independent class functions. Hence a finite 
group can have no more inequivalent irreducible representations 
than classes. 

Whereas the general concept of a representation seemed at 
first to open up limitless possibilities, we now see that all 
representations are constructed from primitive ones and that 
the number of possible primitive representations is confined 
within narrow limits. The further content of the general theory 
of representations can be stated in the theorem that the sets of 
functions , the orthogonality of which we have shown above , are 
complete orthogonal systems. The primitive characters con- 
stitute a complete orthogonal system in the domain of class 
functions, i.e. there exist exactly K inequivalent irreducible 
representations. The components of a complete system of K 
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inequivalent irreducible representations constitute a complete 
orthogonal system for the totality of functions defined on the 
group manifold, or 

h — g 2 + g'* + • * •> 

where the sum on the right is extended over such a complete 
system and g, g', • • • are the dimensionalities of the individual 
irreducible representations. 

§ 12. Extension to Closed Continuous Groups 

The theory developed in the preceding sections cannot be 
extended to arbitrary groups, but it is applicable mutatis 
mutandis to a group whose elements constitute a continuous 
closed manifold of a finite number of dimensions. Just as the 
immediate neighbourhood of a point on a surface constitutes 
a plane, so the immediate neighbourhood of a point p 0 on an 
r-dimensional continuous manifold constitutes an r-dimensional 
linear manifold and the line elements from p 0 to neighbouring 
points p define an r-dimensional linear vector space. We 
assume that the infinitesimal elements of our group g (i.e. those 
dements in the neighbourhood of the unit element I), or rather 
the infinitesimal vectors leading to them from I, constitute 
such an r-dime.nsional vector space, the “ tangential space ’’ 
to g at I. The concept of an infinitesimal rotation will be 
familiar to the reader from the kinematics of rigid bodies, as 
well as the fact that these infinitesimal rotations in 3-dimen- 
sional space constitute a 3-dimensional linear family — in n-dimen- 
sional space an [«(» — l)/2]-dimcnsional family. The multiplica- 
tion of two infinitesimal elements of the group is then expressed 
by the addition of the corresponding vectorial line elements in 
the tangential space. 

A parallelepiped which will serve as a volume element in 
the neighbourhood of I is defined by r linearly independent 
line (dements, and its volume is given as usual by the absolute 
value of the determinant of the components of these r vectors. 
This volume element is, of course, not entirely independent of 
the choice of a co-ordinate system in the tangential space, but 
the transformation to a new co-ordinate system only multiplies 
the volumes of all such elemental volumes in the neighbourhood 
of I by a constant numerical factor. These volumes are there- 
fore determined to within the choice of a unit of measure ; more 
than this we ran hardly require. 

On extending the theory developed in the preceding section 
to continuous groups integration replaces su nmation, and it is 
therefore necessary to be able to measure volumes on the entire 
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group manifold of g. With the aid of the foregoing volume 
elements in the neighbourhood of f can be measured and com- 
pared immediately with each other, and the same is true for 
the volume elements at any other point of the group manifold. 
The only difficulty lies in carrying the unit of volume from the 
point I to any other point a. Examination of the argument 
of § 11 reveals that the measurement of volume must have the 
following invariantive properties : the volume of an arbitrary 
element must be unaltered by a left-translation of the group 
manifold which transforms the general element t into r = at . 
But this requirement just suffices to specify the process uniquely. 
Consider the volume element at a which arises from an elemental 
volume at I by the left-translation which throws I into a ; per 
definitionem the volumes of these two elements shall be the same. 
On carrying the volume element from a to b by means of the 
translation t! = (ba~~ l )t the equation t' = b(a~H) shows that with 
this definition of volume the volumes of the elements so obtained 
at a and b are equal. 

We further assume that our continuous group manifold is 
closed — in the sense, for example, that the surface of a sphere 
is a closed manifold in contrast with a Euclidean plane, which 
is open. This guarantees that we shall be able to integrate 
continuous functions of position on the group manifold over the 
entire manifold. We now choose the unit of volume in such a 
way that the volume of the entire manifold g is 1 ; the integrals 
are then mean values. We naturally require that the components 
of U(s) in a representation s ->■ U(s) are continuous functions 
of the element 5 of g. The laws (11.5), (11.6), (11.7), (11.7') 
and all consequences obtained from them in § 11 are then valid 
for irreducible representations of the continuous group g and their 
characters . 8 

The theory would be extraordinarily restricted if the measure 
of volume , which we have introduced in such a way that it is 
invariant under left- translations, were not automatically invariant 
under (1) right-handed translations : s s' = sa and (2) inversion : 
s -> s' = 5 " 1 . The first of these properties will be established 
by showing that the volume of a volume element at I is unchanged 
on taking it to a by a left-translation and returning it to I by a 
right-translation. Obviously each infinitesimal element 85 of 
the group then undergoes the linear transformation A : 

8s 8's == a * 8s • a'" 1 , 

i.e. the conjugation associated with the element a. Such 
linear transformations in the r-dimensional vector-space of the 



162 GROUPS AND THEIR REPRESENTATIONS 

infinitesimal elements of the group constitute a representation 
a-* A of the abstract group g. Since g is closed , each A must be 
11 absolute-unimodular” i.e. the determinant of A must have the 
absolute value 1 ; and this in turn allows us to conclude that 
the definition of transportation of volumes by either left- or 
right-translations leads to the same result. To prove this 
consider the element a and its powers a 2 , a z , • • \ Since the 
group manifold g is closed, the infinite set a , a 2 , a*, • • • on g 
possesses a point of condensation b , i.e. an infinite set of ex- 
ponents n can be found such that as n runs through this set 
a n converges to b . To the elements a n and b correspond the 
conjugations A n and 5, respectively, and in virtue of the con- 
tinuity assumed above det ( A n ) converges to det (J3) as n runs 
through the chosen set. Now since det (B) is a finite non- 
vanishing number, and since, if the absolute value of the deter- 
minant of A differed from 1, det ( A n ) would tend toward 0 or oo, 
we may conclude the truth of the above assertion. This also 
enables us to prove the truth of (2), invariance under inversion. 
For inversion sends the element 8s at I into — 8s, and this 
transformation is absolute-unimodular. Now send one of two 
inverse volume elements at I to a by a left-translation and 
the other to a" 1 by a right-translation ; we thus obtain volume 
elements at a and ar 1 which go into each other by the inversion 
s' ~ s~ x . Since both left- and right- translations conserve 
volumes, these two volume elements have the same volume. 


2 * /C 

J e{m<f>) e(m'<f>) d<f> = l 

n ' 


Examples of the Orthogonality Properties 

We have already found the primitive characters for the 
group of rotations b 2 of a circle into itself ; e(m<f>), m = 0, ±1, 
± 2, * • •, where <j> is the angle of rotation. They constitute, 
in fact, a unitary-orthogonal set of functions : 

[ 27 t (m = iri) 

0 + m Y 

If there existed further irreducible representations their char- 
acters would necessarily be orthogonal to all of these ; but this 
is impossible, for the functions e(m<f>) y where m takes on all 
integral values, already constitute a complete orthogonal 
system. We have, however, already shown by a more direct 
method (§ 8), which did not involve Parseval’s equation, that 
the system of primitive characters e(mj>) was complete. It is 
therefore natural to consider Parseval’s equation as the simplest 
case of the general group-theoretic completeness theorem men- 
tioned in § 11. 
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The character of the representation ©/ of the 2-dimensional 
unitary unimodular group u = u 2 is given by (9.6). Writing 


s = e(w), A = s — e"" 1 


2 i sin oj, 


— A A doj = da, 


we have 



ri ow) 

lo (/*g)‘ 


( 12 . 1 ) 


This leads us to suspect that da is the volume of that portion 
of the group manifold occupied by those elements a of the group 
whose angles of rotation lie between oj and oj + doj . [The 
total volume of the group manifold is then 

-t f A Kda> = 1.1 

2tt J 

If this is correct, (12.1) are the orthogonality relations predicted 
by the general theory, and the equation 

da = A A doj 


defines the density of the various classes of the group. In the 
last chapter we shall actually carry through the determination 
of volume and verify these results. 

If there were yet another irreducible representation, with 
character x > then £ = A • x would be an odd periodic function 
of oj with period 2tt which would be orthogonal to all the functions 
= A * Xf) he. to the functions 

sin co, sin 2a>, sin 3a;, • • \ 

But these latter are already a complete orthogonal set for 
the domain of odd periodic functions, and consequently the 
(£/ (/= 0, 1, 2, • * •) constitute a complete system of irreducible 
representations of the group it. A direct proof, which is inde- 
pendent of Parseval’s equation, is also to be found in Chap. V, 
§ 16 — indeed, it is there carried through for u n in an arbitrary 
number n of dimensions. 

The Clebsch-Gordan series 

XfXe = Xu* + Xf+t-t + • • • + X|/-»| (12.2) 

for the characters xr * s readily verified. If we know on general 
grounds that the character of a representation specifies it uniquely, 
this equation can be used as a proof of the reducibility of ©/ X 
into irreducible components with characters as on the right. 
Since the characters are much more readily handled than the 
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representations themselves this principle offers a very powerful 
method for obtaining assertions concerning representations. 
Let / ^ g and multiply equation (12.2), which is to be verified, 
by A : 

6x» = EL (» —f+g,f+ g — 2, • • •, f—g). 

V 

The product of 

— s -</ 4 !) with Xo — eg + e^" 2 + • • * + £~ g 
is the difference of two sums ; the one is 
ef'rg+i 0-1 -f- . . . -j- 

the exponent decreasing by 2 from term to term, and the other 
is obtained from this one by replacing all exponents by their 
negative. Hence the product is in fact 

V. {S *H1 _ ,-(«.M)} f t> =/+*,/+*- 2, •••,/ - f. 

The representations (£/, (£/" (/= 0, 1, 2, • • •) constitute a 
complete set of inequivalent irreducible representations of the 
augmented group ui. To establish this we first note that in an 
irreducible representation of u’ the matrix associated with the 
element t must be a multiple of the unit matrix, for it commutes 
with the irreducible system of matrices constituting the repre- 
sentation. Furthermore, u — I, so this matrix can only be 
-+- 1 or — 1. Since the matrix associated with t is a multiple 
of the unit matrix, and since the extension of u to u’ involves 
the addition of a single element t, the representation must remain 
irreducible on restricting the group u’ to the sub-group u. Hence 
every irreducible representation of ltg is obtained by supplement- 
ing the irreducible representations of u s by the association 

«. -*■ -t- 1 or i -*■ — 1. 

If Ip, ip' run independently through complete systems of 
inequivalent irreducible representations of the two (finite or 
closed continuous) groups g, g', respectively, then the ip X §' 
constitute a complete system of inequivalent irreducible rep- 
resentations for the direct product g X g\ To prove this we 
note that since the primitive characters x( s ) of g constitute a 
complete orthogonal system for class functions of the element s 
which runs through g and the primitive characters x'(s') of g' 
do the same for g', the totality of the products x(s) • *'(/) con- 
stitute a complete orthogonal system for the class functions of 
the element (s, s') which runs through the group g X g'. 

The representations E/, , introduced in § 5 constitute a com- 
plete system of irreducible representations of c, when /, g run 
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independently through the numbers 0, 1, 2, • • • ; we here 
only mention this fact without going further into it. 

§ 13. The Algebra of a Group 

We return for the present to finite groups. In order to be 
able to express the completeness theorem we associate with 
each function x(s) on the group manifold of the finite group g 
its “ Fourier coefficient matrix,” the group matrix . 

X = 2Jx{s)U(s), (13.1) 

8 

where U(s) is a representation of g. The trace of X ) 

£ = £x(s) x (s), (13.2) 

8 

is the Fourier coefficient of x(s) with respect to the character 
x(s) of ip. It is here desirable to consider the function x(s) as 
a single quantity x in the group domain ; each element s of the 
group is a dimension in u group space ” and the number x(s) 
is the ^-component of the quantity x. We may express the 
quantities themselves symbolically in the form 

x/= 2Jx(s) • s. (13.3) 

8 

The matrix X is associated with the quantity x in the repre- 
sentation § : x -> X in ip. Addition of “ group quantities ” and 
multiplication of them by a number are introduced in the usual 
way: x + y has the components x(s) + y(s) and ax the com- 
ponents a • x(s). Group quantities consequently behave like 
vectors in an A-dimensional space, where h is the order of the 
group. The following definition of multiplication of two arbitrary 
group quantities x and y is suggested by (13.3) : 

z = xy = Z x(t)y(t')tt f = Zb(s) * s 

t, (' 8 

where 

z{s) = £ x(t)y(t'). (13.4) 


This last equation, in which the sum is to be extended over all 
pairs of elements t } t f whose product is 5, defines the product z 
of the quantities x and y. We denote this product by xy and its 
components by xy(s) ; this is not to be confused with x(s) • y(s) } 
the ordinary product of the two numbers x(s), y(s ). Addition 
and multiplication of group quantities parallel addition and 
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multiplication of the group matrices associated with them by 
(13.1). Indeed, the product of 

X = Ex{s)U{s), Y — Ey{s)U{s) 

t f 

is given by 

2 = X7 = E U(tt') = £z(s)U(s), 

t,t' t 

where z(s) is defined by (13.4) 

The operations to which the group quantities may be sub- 
jected : (1) addition, (2) multiplication with a number, and (3) 
multiplication with one another, satisfy the usual laws of 
ordinary algebra with two important exceptions : multiplication 
is not commutative and division is not in general possible , i.e. the 
equation ax — b for given a 4= 0 and b may have no unique 
solution or even no solution at all. But there does exist a 
quantity 1 having the properties of unity : la = al = a for 
every quantity a ; its components all vanish with the exception 
of the one associated with 5 = 1, which is 1. A domain of 
quantities as described above is called an algebra ,• and the 
“group quantities ’’ are the elements of the algebra ; care must 
be taken not to confuse these with the elements of the group 
(cf. V, § 5). The association in the representation § 

satisfies the conditions : 

1. 1 -*■ 1, to the element 1 corresponds the unit matrix 1 ; 

2. if x X, y -*■ Y and a is a number, then 

x + y-*X+Y, ax -* aX, xy-^XY. 

A representation § of the group is the same as a realization or 
“ representation ” of the algebra of the group by matrices such 
that these conditions are satisfied. Actually all we have done 
here is this : we have gone over from the matrices U(s) associ- 
ated with the individual elements of the group to the linear 
manifold of matrices for which they constitute a basis. 

What characterizes an element a of the algebra whose com- 
ponents a(s) define a class function ? We have in general 

ax{s) = £a(st)x[r' 1 ), xa{s) — J£fl(fs)x(r l ), 

and a class function satisfies the equation 

a(5i) = a(ts). 

Hence such an a is characterized by the fact that it commutes 
with all elements x ‘of the algebra : ax = xa. Employing a 
term carried over from group theory to algebra we may say : 
those elements whose components depend only on the class of 
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conjugate group elements to which the argument s belongs constitute 
the central of the algebra. 

We are interested only in unitary representations s U(s). 
For such a representation the Hermitian conjugate of (13.1) is 

X = £x{s)U(s) = £x{s)U{s-i) = Zx{sr*)U(s). 

S $ s 

Hence on defining the conjugated of the element a by x(s)=x(s^ x ) ) 
Hermitian conjugate matrices are associated with conjugate 
elements in a unitary representation ; this characterizes unitary 
representations. An element will be said to be real if it coin- 
cides with its conjugate. We have seen that the character 
x(s) of a unitary representation satisfies this condition 
xl s ) = xis- 1 ). 

Let § be a g-dimensional irreducible unitary representation 
of g. C = \\c ik \\ being a given g-dimensionai matrix, the element 
c of the algebra defined by 

c{s) = Zc ik - g f ik {s) = \tT[C?J(s)] 

is such that c -» C in § ; this is readily verified with the aid of 
the orthogonality relations. Hence in the correspondence x X 
X runs through all g-dimensional matrices. We denote the 

quantity with components | u ik (s) by e ik . The set H of all 

elements of the form 

Z c ik e ik, 
i,* 

where the coefficients c ik are arbitrary, is naturally closed with 
respect to the operations of addition and multiplication by a 
number. But the product of two elements in H is again an 
element in H ; indeed, if c is in H and A" is an arbitrary element 
of the algebra both cx and xc are also in H. We express this 
situation in a terminology paralleling that of the theory of groups : 
H is an invariant sub-algebra of the algebra JT of all group quantities. 
To prove these assertions we first note that the definition (13.1), 
together with the condition that s-> U{s) be a representation 
yields the equation 

XU(s -*) - Zx{t)U(tsr x ), 

t 

or, on replacing U(s~~ l ) by 

xu(s) = zdisr^xit). 

t 


(13.5) 



168 GROUPS AND THEIR REPRESENTATIONS 

Multiplying on the left by C = \ |c<*| ! and constructing the trace 
we find 

| tr [(CX) U(s)] = t) = cx(s), 


whence y — cx is in H 




cx = E Vik 

t. k 

(13.6) 

and the matrix 


\\Vik\\ - cx. 

(13.7) 


In the same way we can show that if c belongs to H then xc 
does also. If 

*-> x= ||#«|| in§ 

we call 

£%ik e ik 

the component of x in H. In accordance with (13,6), (13.7) this 
component is the product of x with 

» = «11 + + • • • + ; 

it is sx = xb . s is a real element belonging to the central of the 

group algebra P, with components | • x(s) ; it is “ idempotent,” 

i.e. it satisfies the equation se = 8. In particular, the product 
of two elements 

a = Eaik e Oc, b — S^ik^ik 

of H with coefficient matrices A, B, is the quantity ab in H 
with the coefficient matrix AB. 8 is the 1, the “ modulus,” or 
“ principal unit,” of the sub-algebra H since ex = xs = x when 
x is in H. The algebra H is identical with the algebra of all 
^-dimensional matrices (“ simple matric algebra ”). The “ units ” 
e ik satisfy the equations 

e ir e rk = e ik , e ir e tk = 0 for r #= s. (13.8) 

The central of the sub-algebra H consists only of the multiples 
of its modulus 8. 

An irreducible representation : s -* U'(s) = | u'^s) || of 
dimensionality g' which is not equivalent to § yields another 
invariant sub-algebra H' consisting of all elements of the form 

iK.ll-c'- 


*1* 
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The components of e[ K are j^u[ K {s). It follows from the 

orthogonality relations existing between inequivalent repre- 
sentations that d 0 in the representation ip. If c is in H, 
then, by applying (13.6) for x = d , cd = y is also, but since 
then X = 0 (13.7) yields y = 0 ; the two sub-algebras are 
independent in the sense that the product of an element in one 
with an element in the other is always 0. Hence the “ units ” 
satisfy 

e ik d lK = 0. (13.9) 

The modulus 

«' - 

of H' satisfies ee' = e's = 0 in addition to e'e' = e'. 

If a(s) is a class function, a belongs to the central of r and 
if a~> A in the ^-dimensional irreducible representation § 
then the matrix A commutes with all matrices X. Hence A 

is a multiple of the unit matrix : A = - 1. By (13.2) we find 

& 

that the trace a of A is * 

a = Za(s) x {s). 

8 

In this way the entire theory of representations can be 
translated into the language of modern algebra. This leads to 
a greater freedom of operation and is preferable for the expression 
of the completeness theorem. The orthogonality relations 
between u^s), u , iK (s) > * • * lead to Bessel’s inequality 

g ■ tr {Xl) + •••§*• £x{s)x{s), (13.10) 

8 

where X in the sum on the left is the matrix (13.1) associated 
with x ($) in the g-dimensional irreducible representation § and 
the sum is taken over any set of inequivalent irreducible repre- 
sentations * • *. This inequality is obtained by expressing 
the fact that the mean value of z($) z(s) is non-negative (cf. I, § 7), 
where z is that element obtained from x on subtracting from x 
its components in H, * * * : 

Z = x — (2X* e ik + • • •) = x — (X8 + • • •)• 

Since the characters constitute an orthogonal system we also 
have the Bessel inequality 

# + • • • £h- £x{s)x(s) (13.11) 

8 

* Cf. also Appendix 2 at the end of the book. 
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where £ is defined by (13.2). The completeness theorem asserts 
that in both cases the equality sign holds when the sum is extended 
over a complete system of inequivalent irreducible representations , 
where in (13.10) x(s) is any function on the group manifold and 
in (13.11) any class function. The second relation is a special 

case of the first, since for class functions X = 1 1. 

If the abstract group g is a finite continuous group which 
is closed in the sense of § 12, instead of a finite group as above, 
the sums must be replaced by integrals ; the measure of volume 
on the group manifold is introduced as in § 12. We then have 
in place of (13.1), (13.4) : 

X = ^x(s)U(s)ds, 

xy{s) = ^x{st- x )y{t)dt — jjc(f )y(t~ l s)dt. 

The modulus 1 of the algebra must have as components the 
values of a function 1 ( 5 ) which vanishes everywhere on the 
group manifold except at the point s = 1 and must there be 

so large that Jl(s)ds = 1. Such a function does not exist, but 

we can construct functions approximating these conditions 
arbitrarily close. 

The completeness relations assert that any element x of 
the algebra of a finite group g is the sum of its components in 
the totality of sub-algebras associated with a complete system 
of inequivalent irreducible representations. The group algebra 
r is thus reduced to a set of independent simple matric algebras. 
It suffices to prove this theorem for x = 1 : 

l'= 6 + «' + ••• = (e u + • • • + e„) + * • (13.12) 

for on multiplying this by x it follows for all elements x. These 
assertions cannot be carried over to continuous groups in the 
form here stated ; we must hold to the formulation (13.10) 
(with = instead of s*) containing an arbitrary function x(s). 
We go into the proof of these results in Chap. V, where all 
the results of this section will be derived anew and discussed in 
detail from another more profound point of view. 

§ 14. Invariants and Covariants 

We first discuss briefly the classical concept of an invariant. 
Consider, for example, the group c = c 2 of homogeneous linear 
transformations of two variables £, 17 with unit determinant. 
Let 


a£ 2 -f- 2b£r) -)~ erf- 
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be an arbitrary quadratic form in the two variables. The 
“ discriminant ” ac — b 2 is an invariant, for the discriminants 
of two forms which are such that either goes into the other on 
transforming tj by some element of c have the same value. 
We may have, instead of one arbitrary quadratic form, one or 
more arbitrary forms /,<£,••• of given orders, n, v, • * *. An 
invariant is a rational integral function I of the coefficients of 
these forms which is homogeneous in the coefficients of each of 
the forms /, <f>, * • • and which has the same value on replacing 
these coefficients by the coefficients of the forms /', <f>', • • • into 
which /, • • * are transformed by an arbitrary transformation 

a of c affecting the variables £, rj. 

The coefficients # 0 , a h • • a n of an arbitrary form of order 
n in the variables £, rj undergo a certain linear transformation 
on subjecting the variables to a transformation o* of C, and the 
correspondence between a and this transformation constitutes 
a represe?itation of the group c. The same is true for the totality 
of monomials 

a r 0 0 a[ l * • • a r n n (r 0 + r x + - • • + r n = r) 

of order r in these coefficients. A homogeneous polynomial 
I of order r in the is a linear combination of these monomials. 
We thus see that if I is of given degrees r, p, • • • in the coefficients 
of the arbitrary forms /,<£,*•* it is a linear combination of 
quantities which constitute the substratum of a definite re- 
presentation of c ; this representation is known as soon as we 
have given the orders n, v, • • • of the forms /,<£,••• in the 
variables rj and the degrees r, p } • • • of the invariant I in the 
arbitrary coefficients of /,</>,*• \ Discarding the all too special 
formal algebraic assumptions involved in the “ classical ” 
concept of an invariant, and which the theory of invariants has 
from the beginning attempted to outgrow by generalizations in 
various directions, we may express the concept in modern 
group-theoretic language as follows : 

Let $Q : s U(s) be a given representation of an abstract group 
g in an n- dimensional representation space 81 with variables % { ; 
a linear form in the is said to be an invariant in the representation 
space SW of § if it is unchanged under all the transformations U(s). 
If I h / 2 , • • • are invariants in the representation space of 
then any linear combination + a 2 / 2 + • • • of them with 
constant coefficients a lf a 2 , • • • is also an invariant. The most 
important problem arising here is naturally that concerning the 
number m of linearly independent invariants in the given 
representation space. If y l9 y 2 , * • * y m constitute such a com- 
plete set of linearly independent invariants, and if we choose as 
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co-ordinates in 9ft these m quantities and n — m further linear 
forms y m + 1} * • *, y n such that the two sets together constitute 
a complete system of linearly independent linear forms in 
the transformation U(s) is, in terms of the variables y, 

y\ = yi, • • •» y'm = Vm ; 

y'm+1 — 1 CO y\ + • ’ • + u m+l, n ( s ) Vn> 


y n = U n i (s) y x + • • • + u nn{ s ) y n* 

If we are dealing with a unitary representation the y's can be so 
chosen that they define a normal co-ordinate system ; <p is 
then reduced into m times the 1-dimensional identical repre- 
sentation y* = y and an (n — w)- dimensional representation. 
Hence the problem of finding the number of linearly independent 
invariants in the representation space 91 reduces to finding how 
often the identical representation with the character 1 is con- 
tained in the given But by formula (11.8) the solution of 
this problem is given by 

m = W{x(s)}, (14.1) 

or : the mean value of the character x of which is always a 
non-negative integer , gives the number of linearly independent 
invariants in the representation space of 

The formula (14.1) answers the principal question arising 
in the linear invariant theory, and we now proceed to an ex- 
tremely brief discussion of the algebraic invariant theory. Let 
©, $, • • • be representations of the same abstract group g in 
the spaces with variables x i} y k , • • \ We consider rational 
integral functions I(x it y k , • • •) which are homogeneous in the 
variables x i} homogeneous in the variables y kr etc. If on sub- 
jecting x, y, • • • to those linear transformations corresponding 
to the same arbitrary group element s in the representations 
©,$,•••/ remains unchanged, then it is said to be a rational 
integral invariant of the system [©, §,•••] of representations . 
If the orders p y q, • • ; of the function I in the variables x it y k} • * • 
are given, the problem reduces to the one discussed above ; 
for the monomials in these variables which are homogeneous 
of order p in the x {) homogeneous of order q in the y k} • • • con- 
stitute the substratum of a representation obtained in a certain 
way from ©, • • •. But if we consider simultaneously in- 
variants of all possible orders belonging to the system [©, * • *] 

we are confronted with new problems. The most important of 
these, which is answered in the affirmative by the so-called 
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fundamental theorem of the theory of invariants is : Do there 
exist a finite number of invariants such that all others can be 
expressed rationally and integrally in terms of them ? This 
involves the question of algebraic, rather than linear, dependence 
between the invariants. We only mention this higher branch 
of the theory of invariants, and do not go into it further, as it 
bears no direct relation to quantum mechanics. 10 

In addition to invariants or scalars, covariant linear 
quantities , such as vectors and tensors, play an important 
role in physics. Let g be the group of all linear transformations 
between the normal co-ordinate systems in space or in space- 
time, e.g. the 3-dimensional group of Euclidean rotations or 
the group of Lorentz transformations, and let <p:s-> U(s) be 
an n-dimensional representation of g. A covariant quantity of 
kind § is an entity having n components a 1} a 2 , * * •, a n relative 
to any given co-ordinate system for the variables of the transforma- 
tion group g and which is such that on going over to a new co- 
ordinate system by means of the transformation $ of g the new 
components a € are obtained from the old by the corresponding 
transformation U(s) of ip. If ip is irreducible such a quantity 
is said to be primitive or simple . Physical quantities are generally 
simple . Thus, for example, the entity whose components are 
the electro-magnetic field strengths in the 4-dimensional world 
is described as an “ anti-symmetric tensor of order 2 ” rather 
than merely as a “ tensor of order 2 ” ; we shall see in Chap. V, 
§ 4, that it is therefore a simple quantity. The reduction of 
a representation into its irreducible constituents implies the 
reduction of the corresponding kind of quantities into simple 
quantities. It would appear that the only simple quantities 
with which we deal are tensors which are characterized by 
certain symmetry conditions in addition to their order. We 
shall prove this theorem for the complete linear group c and for 
its unitary sub-group it in Chap. V ; it asserts that all repre- 
sentations of c (or it) can be obtained by reduction from the 
powers c, (c) 2 , (c) 8 , * * * and that the irreducible constituents 
of (c)/ are obtained by imposing certain symmetry conditions. 

We must accordingly generalize the problem of the linear 
theory of invariants in the following manner. Consider two 
unitary representations f) : cr -> s, ip : a S of the abstract 
group g with elements a ; let their dimensionalities be n , N 
and let f) be irreducible. We wish to determine all covariant 
quantities of kind f) in the representation space of ip. Calling the 
variables in this representation space x { , which undergo the 
transformation S under the influence of cr, such a quantity 
I has n components I lt / 2 , •••,/,» which are linearly independent 
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linear forms in the variables x { . When the x { undergo the trans- 
formation S the n linear forms go over into new ones which 
are obtained from the (in which the variables x { have been 
transformed in accordance with S) by means of the transforma- 
tion s of f). If there exist two or more covariant quantities 

i = (a, 4 • * •, a), /' = (/;, 4- • •, a • * • 


of the kind f) in the representation space of then any linear 
combination a / + a'/' + * • • with constant coefficients a is 
again a quantity of the same kind. We ask for the number m 
of linearly independent quantities of this kind. The answer is 
that m is equal to the number of times the irreducible representation 
f) is contained in Hence if x, X are the characters of f), ip, 
we have 


m = 9R{X(«}. (14.2) 

In order to prove this statement we choose the co-ordinate 
system x { in the representation space of § in such a way that 
the matrices of § are reduced into their irreducible constituent 
sub-matrices, the m representations f) : f)' = f)" = • • * = ljM = f) 
being separated out first. The remaining constituents 
• • • are inequivalent to f). Denote the variables in the corre- 
sponding invariant sub-spaces by 






; A m \ 


•* * 


t») . . . . 

n y 


The matrix S is completely reduced into the sub-matrices 
s' = s, • • •, s< m > = .s ; • • • arranged along the principal 

diagonal. Let 

Vi — a u x i + ’ ‘ * + & 1 N 1 

Vn — a nl x i + ■ ’ • + &nN X N< 

be a covariant quantity of the kind f). We can write this in the 
form y = Ax in terms of the column x of the N variables x it 
the column y of the n variables y a and the matrix A — |la«J|. 
The requirement that I be a quantity of kind f> means that 
when x is replaced by x' = Sx, y goes over into y' = sy, or 

sy = ASx, sAx = ASx, sA = AS. (14.3) 

Corresponding to the reduction of #-space into irreducible 
sub-spaces, the matrix A of the correspondence of a;-space on 
y-space is reduced into matrices A', • • A (m) ; A < m * » ■ • • 
consisting of the first n rows, • • •, the m tb set of n rows, ’••• 

' • ' of A. Equation (14.3) then becomes 

sA' = A's, • • •, sA( m) = A (m) s ; sA (m ‘ h = . . . 
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? f0l l°w h A' m f “ n ? amental theorem (10.5) on representa- 
*°> S afriv 4 -u t- are a ^ multiples of the ^-dimensional 
p“J * * l ? that the remaining A<™ +1 \ • • • are all zero. 
But this is just our assertion that y = (y, v , ... v 1 is a 
linear combination of the m quantities ^ V ’ ™ 

x ’ = 4 , * • 4 ), 


of the kind f). 


r ( m ) . 


( 4 m) , 4 m) , • • •, 4 m) ) 


§ 15. Remarks on Lie’s Theory of Continuous Groups 
of Transformations 

In § 12 we made use of the concept of infinitesimal elements 
oj a group m order to establish a method of measuring volume 
on a continuous group manifold. We here discuss this concept 
in detail for the 3-dimensional group b of rotations in Euclidean 
space. 11 This group serves to describe the mobility of a body 
m Euclidean space, one point O of which is fixed in space. Each 
possible position of the body can be considered as arising from 
any given initial position by an operation of b. A material 
substance distributed throughout the space or any portion of 
it moves as a rigid body about 0 if the position of each of its 
elements at a given moment is associated with its initial position 
by means of a correspondence belonging to b. This is the 
description of the motion of such a rigid body which compares 
the position in any moment directly with the initial position, 
ignoring the intermediate states which it has assumed in going 
from the one into the other. But it seems more natural to 
consider it in terms of a continuous motion in which the position 
of the body undergoes an infinitesimal rotation from moment 
to moment, so that the motion as a whole is the integration 
of a series of infinitesimal operations of b. On employing an 
auxiliary variable t in order to avoid the use of i nfini tesimals 
and thinking of this parameter as time, the velocity field 
dx — x, dy = y, dz — z of an infinitesimal rotation is defined 
by [cf. I, § 6] 

dx = bz — cy , dy = cx — az, dz — ay — bx, (15.1) 

where the constants a, b, c are independent of position (x, y, z). 
These velocity fields, which obviously constitute a 3-dimensional 
linear manifold, are the infinitesimal elements of b ; they are 
the “ vectors ” which define the linear space tangent to the group 
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manifold at the point which represents the unit element I. 
The continuous motion of a rigid body about 0 is characterized 
by the fact that at each moment its velocity field belongs to 
the 3-parameter linear family (15.1). We may take as a basis 
of this family the three elements D X) D 1Jf D z obtained by choosing 

« = 1, 6 = 0, c = 0) a = 0,b = l, r = 0 ; a = 0, b = 0, c = 1. 

We call these “ the infinitesimal rotations about the x -, y- and 
s-axes.” S . Lie was the first to undertake a systematic study 
of the construction of transformation groups from their in- 
finitesimal elements. . In fact, once they are known all the 
substitutions of the continuous group can be generated by 
integration, i.e. by successive application of such infinitesimal 
elements — at least, all those which belong to the same connected 
“ sheet ” as the identity. (Example : the proper orthogonal 
transformations can be obtained from the infinitesimal ones, 
but not the improper transformations with determinant — 1). 

In general, consider a continuous r-parameter transformation 
group ©, and let the group manifold be described in terms of 
the parameters s ti s 2 , • • •, s r in the neighbourhood of the unit 
point, at which’ they vanish. A portion of the group manifold 
is thereby mapped in a one-to-one continuous manner on a 
neighbourhood of the origin in the r-dimensional number space 
of the parameters s. Let the n- dimensional point-field of the 
transformations be described in terms of co-ordinates x lf x 2 , • • •, x n 
in the neighbourhood of the point under consideration, and let 
the correspondence x x' : 

%i = <{>{{% i %2 * * ‘ %n > ^1 > * * *> $r) 

be associated with the element (^, s Zl • • •, s f ) of the abstract 
group in its realization by the transformation group. The 
infinitesimal transformation x -> x + dx obtained by assigning 
the infinitesimal increments ds to the parameters s in the neigh- 
bourhood of s — 0 is given by 



the parentheses indicate that the differential quotients are to 
be computed for s 1 = 0, • • •, s r = 0. We postulate a material 
substance which fills the point-field and which is capable of 
executing those and only those motions in which the positions 
of its elements at an arbitrary moment t f are obtained from their 
positions at time t by a transformation of ©. Again its motion 
can be more simply described as the result of successive deforma- 


LIE’S THEORY OF CONTINUOUS GROUPS 177 

tions corresponding to infinitesimal operations (15.2) of our 
group ; the velocity field must at any time have the form 



where cr lt * * *, 07 are constants independent of position. This 
r-dimensional linear family constitutes the infinitesimal group of 
motions of our substance. It is to be observed that the application 
of these infinitesimal processes to our transformation group 
presupposes that the functions </> a - are differentiable with respect 
to ^ at the point $ = 0. In the theory of abstract groups the 
point-field is the group manifold itself and we take as a realization 
(left-) translation. In the neighbourhood of the unit element 
s = 0, t = 0 we have, as law of composition, ' 

{st) a = ; h • * t r ) [a = 1, • ■ *, r]. 

The introduction of a measure of volume in § 12 presupposes 
that the functions ijj a are, for sufficiently small t, differentiable 
with respect to the s at the point s = 0 } and that for sufficiently 
small 5 they are differentiable with respect to t at t = 0. 

The composition of infinitesimal elements of the group is 
expressed by addition of the parameters a introduced by (15.3). 
It might therefore appear as if the infinitesimal elements of an 
r-parameter continuous group need satisfy no condition other 
than that they constitute a linear family. However, that is 
not the case; there are farther “ integr ability conditions ” to 
be satisfied. The example of a sphere which rolls without 
slipping on a horizontal table shows that the possible positions 
of a body whose infinitesimal motions have but three degrees 
of freedom can nevertheless constitute a 5-dimensional manifold. 
The integrability conditions we are seeking, which involve 
second order derivatives, guarantee that this situation does not 
arise. We obtain these conditions on expressing the fact that 
the commutator of two infinitesimal elements 5 , t of the 

group also is an element of the group. This commutator con- 
verges to I as s approaches the unit element I, whatever t may 
be, and similarly as t l for arbitrary * The commutator of 
the two infinitesimal linear correspondences A and B : 

dx = Ax , d'x = Bx 

is the infinitesimal correspondence AB — BA ; to show this 
we note that the equation 

A(S)B (t) » r(s } t)B(t)A(s) 
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leads, on writing 

/d Bn 

' (-0 


m writing 

A(0) - 1, B{0) - 1, (f ) - (f 


lim 

s,t->0 

to the equation 


r(s, t) - i ( d?r _ s = 


S • t 

C = AB 


\ds dt) 8 ,t 
BA. 


C, 


Our main purpose in mentioning these matters is to prepare 
the ground for an understanding from general principles of ^ the 
commutation rules satisfied by the three infinitesimal rotations 


Dyi D z : 


0 0 0 

0 0-1 


0 0 1 

0 0 0 

1 

0 

1 

-1 0 

0 0 

0 1 0 


-10 0 


0 

0 0 


(15.4) 


They are, as is readily shown, 

D x D y — D V D X = D z , D y D z -D z D y = D X) 


D Z D X 


D X D Z Dy . 


} (15.5) 


We could, of course, take the unimodular unitary group xt a 
in two dimensions as fundamental, instead of the group b 3 of 
rotations. We denote the two variables which undergo the 
transformations a of the unitary group by f, v] as in § 8. In 
consequence of the correspondence a -> s y which was established 
there by means of a stereographic projection, the 3-dimensional 
rotation group now appears as a representation of U*. We can 
take as a basis for the 3-parameter linear manifold of infinitesimal 
operators of U 2 the three particular operators — 



d * = Yi 71 ’ dv) = 

^ v, d v— \ £ ; ► 

#= d v =-~ v] 


(15.6) 


here, in agreement with (8.15), 


0 

1 

II 

0 

1 

, £ z = 

1 

0 

1 

0 

II* 0| 


0 

-1 


They are the infinitesimal transformations of u 2 corresponding 
to the three infinitesimal transformations D xy D y) D z of b 3 in 
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virtue of the correspondence a 5 ; that this is in fact the case 
is readily seen from (8.10) or 

* f* + y 2 , y ~ l(£ 2 + y 2 ), 2 ~ 2 fa. 

Given any representation § : a U{o) of u„ its infinitesimal 
operators with matrices 

\ (M x , My, M z ) 

corresponding to the infinitesimal operators (15.6) in u 2 
satisfy the same equations (15.5) as the D x , D y , D,: 

M x My - M y M x = iM z , • • .. (15.7) 

The matrices M x , M y , M z are of course Hermitian. For reasons 
which will appear in the following chapter we call these the 
components of moment of momentum (or angular momentum) 3)1 
of the representation §, and 

SO? 2 = M 2 = Ml + Ml + M; 

the square of the magnitude of the moment of momentum. If 
are two representations with angular momenta 3)1, 3ft' 
■ then, in accordance with the general formula II, (10.4), which 
governs the composition of infinitesimal operators by X -multi- 
plication, the representation § X §' has as moment of momentum 
(3ft X 1) + (1 X 3)1'). 

We next calculate the moment of momentum 3ft* of the 
irreducible representation (£/=$* (j — // 2) of u 2 . It will be 

found more convenient to employ in place of w-.S„, the 

2 1 2i J 

transformations 

a (S x + iS y ) : d£ — y, dy — 0 

. (15.8) 

± d$ = 0, dy = £ 

In general 

d(£ r y a ) = r £ r_1 y‘ d£ + s y‘~ l dy, 
and on substituting in this the variables 

x(m ) — ■ ~JL= (r + s = 2j, r — s = 2 m) 

Vr! j! 
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of the representation space of we find that the three infinitesi- 
mal transformations of u 2 defined by (15.6), (15.8) induce in this 
space the transformations 

\ ( S x + iSy) : dx(m ) = Vr(s + f) x(m - 1) 

= V(; + m){j — m + 1) x(m — 1), 

\ (5, - iS y ) : dx(m) = Vs(r + 1) x(tn + 1) 

= V(J — m*K7 + m + 1) *(»» + 1), 

S - : dx(m) = r - #(m) = m #(?ra). 

Hence 

(M x + iM 3 )(m, m — 1) = V(i + m)(j — m + 1), 

(M x - m + 1) = V(/ - m){j + w + 1), [ (15.9) 

M 2 (ra, m ) = m. 

All other components ( m , m!) vanish. M 2 is a multiple of the 
unit matrix in 9ft* : 

M 2 = j(j + 1 ), 

for it follows from 

(M x + iM v )[M x - *M.) = + Ml - i(MX - M V M X ) 

= M| + M| + M t 

that 

M* = (M x + iM v )(M x - tM.) -M, + Ml 
and from this and (15.9) that 

M 2 (m, m) = (j + m)(j — m + 1) — m + m 2 = j(j + 1 ). 

If on reducing an arbitrary representation § the irreducible 
representation is found to occur exactly g f times, then M* 
has ](j + 1) as a [(2 j + l)gi]-fold characteristic number and 
M z has the characteristic number m with multiplicity 

Zgi {j = M> \m\ + 1, • •). 

3 

From this we again see that the multiplicity g , with which %, 
occurs in the reduction of § is uniquely determined by £>. 
These infinitesimal operations can be used to give a relatively 
elementary constructive proof of the fact that the 3), are the only 
irreducible representation of % ia 

§ 16. Representation by Rotations of Ray Space 

In quantum theory the representations take place in system 
space ; but this is to be considered as a ray rather than a vector 
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space, for a pure state is represented by a ray rather than a 
vector. Two unitary transformations U and sU which differ 
only by a numerical factor s of absolute magnitude 1 are con- 
sequently to be considered as the same, £/~ef7, for they 
determine the same rotation of the ray field. In a “ ray repre- 
sentation which associates with each element s of the abstract 
group g a unitary rotation U(s) of the rays of n-dimensional 
representation space, the gauge factor e(s) may be taken 
arbitrarily for each unitary matrix U(s) ; if g is a continuous 
group we choose it, however, in such a way that U(s) depends 
continuously on The condition for a representation is now 
only 

U{s)U(t)~U(st), (16.1) 

U($)U(t) = 8( s , t)U(st) ) (16.2) 


where 8(s, t) is a numerical factor, of modulus 1, depending on 
5 and t. If by change of gauge U(s) is replaced by s(s)U(s), 
8 (s, t) is replaced by 

e(st)e-'(s)e-'(t)8(s, t). 

In the equation 


X~2x(s)U(s), 

8 


defining the connection between the components x[s) of an 
element x of the algebra of the group and the group matrix X 
which represents it, the x(s) are also dependent on the gauge 
and are sent into e(s)#(.y) on the change of gauge defined by 
U(s)-*e(s)U(s ). In order that the multiplication law for two 
elements x , y shall, as we require, parallel the multiplication of 
the matrices which represent them we must define 

*y(s) = 2:3(<,O*(0y(O (i«-3) 

tt'^s 

in terms of the chosen gauge. The condition 

x(s~ l ) = x(s) 

for a real element x is only appropriate if the gauge is so chosen 
that U ($~~ l ) is the matrix reciprocal to U(s). The algebra of 
the group is to be adapted in this way to the ray representation 
under consideration, whereas in dealing with “ vector repre- 
sentations ” it is uniquely determined by the law of composition 
of the group alone. 13 

Examples. 

I. The 1 -dimensional representations are now entirely 
uninteresting, for any 1-dimensional matrix ~1. But under 
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certain circumstances Abelian groups may possess multi- dimen- 
sional unitary ray representations , whereas any irreducible 
unitary vector representation of an Abelian group is necessarily 
of degree 1. 

We first investigate the simplest example, a finite cyclical 
group (i a ) of order h } consisting of the elements 

1, a , a 2 , • • •, a h ~ l (a h = 1). 

Let the element a correspond to the unitary matrix A in the 
ray representation ; then A h = ocl is necessarily a multiple of 
the unit matrix. Since a is of modulus 1 we may change the 
gauge in such a way that A goes into A/V a; then A h = 1 
and the correspondence a h -> A k is a vector representation of 
the cyclical group. Hence by introducing an appropriate 
change of gauge the ray representation can be made into a 
vector representation, 8(5, t) being then 1. 

II. The simplest example of an Abelian group which gives 
rise to multi- dimensional irreducible ray representations must 
consequently be non-cyclic. Consider the group consisting of 
the four elements I, a, b, c with the multiplication table 


a 2 = b 2 = c 2 = 1, 

be = cb — a , ca ~ ac ~ b , ab = ba = c. 
A ray representation SS is given by 


(16.4) 


U( I) 


1 0 
0 1 


U(a) 


0 1 
1 0 ’ 


U(b) = 


0 

i 



The normalization is here chosen in such a way that 
lP(a) = U[a)U{ar 1 ) = 1 


1 0 
0 -1 
( 16 . 6 ) 


and similarly for 1, b, c. The algebra defined by (16.3) for. this 
representation is non-commutative in spite of the Abelian 
nature of the group ; it is the algebra of complex quaternions. 
On denoting the elements of this algebra by 


x — * 1 + + 

the units I, a, b, c have the same multiplication table as 
the corresponding matrices U : 



1 

a 

b 

c 

1 

1 

a 

b 

c 

a 

a 

1 

ic 

— ib 

b 

b 

— ic 

1 

ia 

c 

c 

ib 

— fa 

l 


(The product xy occupies 
the intersection of the 
row x with the column y.) 
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The “ real ” quantities are those for which all components 
k, A, /x, v are real. Since in the calculus of quaternions 1, ia, 
ib } ic are taken as the fundamental units, they are those whose 
scalar component k is real and whose vectorial components 
Xji, fi\i , v/i are purely imaginary. 

III. The group u = u 2 of unitary transformations a in two 
dimensions with determinant 1. Consider a representation 
a U(o) by rotations in n-dimensional ray space. On changing 
the gauge in such a way that U(o) goes into 

U (a) : S/det [7(a), (16.6) 

the determinant of the new U(o) is 1. The only possible diffi- 
culty consists in the fact that the n th root 

e(a) = S/det Ujo) (16.7) 

is multiple-valued. It is “ locally ” single-valued, i.e. if we 

have chosen a definite one e 0 of the n values for the point 

& = a 0 , we can uniquely determine the root e(o) in a sufficiently 
small neighbourhood of a 0 in such a way that it depends con- 
tinuously on a and goes over into e 0 for a — cr 0 . Hence we can 
continue the determination of the root for a = <r 0 in a unique 
manner along a path in the group manifold, starting in a 0 . 
The only question is whether 6 (a) returns to its original value 
when we allow a to describe a closed path. This is to be answered 
in the affirmative ) since the group manifold of it is simply connected 
in the sense that any closed curve can be drawn together into 
a point by a continuous deformation. For in accordance with 
equation (7.5) the elements of the group are mapped in a one- 
to-one continuous manner on the quadruple (k X/iv) of real numbers 
which are subject to the condition 

K 2 4“ A 2 + [X 2 + V 2 = 1. 

Hence the group manifold has the same topological properties 
as a 3-dimensional sphere in 4-dimensional space. These con- 
siderations thus show that the n th root (16.7) is broken up into 
n single-valued continuous functions over the entire group 
manifold. The method of proof here employed, which is of 
fundamental importance in the whole of mathematics, is perhaps 
best known to the reader in the proof of Cauchy s integral 
theorem ; it follows from the fact that the integral of an analytic 
function is locally single-valued, that it is single -valued in the 
large if the region in which we are operating is simply connected. 

The result of our topological considerations showed that 
the formula (16.6) defines n single-valued continuous functions 
L/(a). One of them is such that in it U{ I) is the unit matrix ; 
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we henceforth denote it alone by U{a). On writing the equation 

U (a) U (t) = 8 (tr, t)U(ot) (16.8) 

for r= I, and taking into account the fact that U(\) — 1, we 
find h(s, I) = 1. On forming the determinant of both sides of 
(16.8) we obtain the equation 

1 = [8(<r, r)]». 

8(a, r) is consequently an « th root of unity which depends con- 
tinuously on t for fixed a and which reduces to 1 for t = I ; 
hence it is identically equal to 1, and (16-8) becomes 

U(<j)U(t) = U (or) . 

Consequently the only ray representations of U 2 are also vector 
representations , and our considerations show that this theorem is 
valid for any continuous group whose elements constitute a simply 
connected manifold. On going over to the 3-dimensional rotation 
group b 3 by stereographic projection, all even those with 
half-integral j, are single-valued when considered as ray repre- 
sentations. Any single-valued continuous ray representation of 
b a is reducible into irreducible constituents, and the only irre- 
ducible ray representations are the (j = 0, 1/2, 1, 3/2, • - •) 
obtained earlier in the chapter. But b 3 is not simply connected ; 
we must resort to a two-sheeted covering surface, similar to 
a Riemannian surface but without cuts or branch points, which 
is simply connected. This accounts for the fact that there 
exist irreducible ray representations of b 3 which may be single- 
ox double-valued vector representations, but there cannot exist 
multiple- valued representations of higher degree. 

. 1 have been able to prove the same theorem for the n-dimen- 
sional rotation group ( n ^ 3). 11 This means that there exist 
two closed continuous motions (i.e. motions which lead back 
to the initial state) of a rigid body, which is free to rotate about 
a fixed point 0, such that any other closed motion can be con- 
tinuously deformed into one of the two. One of these may be 
taken as rest, and the other is such that it cannot be continuously 
deformed into rest- J 


CHAPTER IV 


APPLICATION OF THE THEORY OF GROUPS 
TO QUANTUM MECHANICS 

A. The Rotation Group 

§ l. The Representation Induced in System Space by 
the Rotation Group 

I N accordance with III, § 8, we can interpret the theory of 
a single electron in a spherically symmetric electrostatic field, 
as developed in II, § 5, in the following manner. A rotation 
of physical space, i.e. an orthogonal transformation from the 
Cartesian co-ordinates xyz into x'y'z', induces a unitary trans- 
formation U(s) : iff -> defined by 

= ip(xyz) (1.1) 

in the system-space 91 of the electron, the vectors of which are 
the wave functions *Jj(xyz) describing the state of the electron. 
The correspondence U(s) is a definite representation (S, of 
infinitely many dimensions, of the rotation group b 3 . This 
representation 6 can be reduced into its irreducible constituents 
and it is found that each < £) l with integral l occurs an infinite 
number of times. The total system-space 91 is correspondingly 
decomposed into mutually orthogonal sub-spaces 91 (nl) ; 91 (nl) 
has 21 +• 1 dimensions and the rotation group induces the 
representation in it. If we introduce in addition the im- 
proper rotations (bg) always appears in © with the signature 
(•— 1)K The oo -dimensional sub-spaces l(nl) associated with 

n 

the various values of l are uniquely determined, but their further 
decomposition into the summands is quite arbitrary. In 

particular, this can be done in such a way that the energy of 
the states composing 91 (nl) has a definite value E(nl). 

We now calculate the operators induced in system-space 
by the infinitesimal ' rotations of physical space. Denoting the 
increase ifr'(xyz) — 'p(xyz) by dtp, equation (1.1) becomes 

# + + | = 0 
185 
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for the infinitesimal rotation 5* which sends 

x , y, z into %' = x + d%, y r = y + dy, z f = z + dz. 

Taking as s the three infinitesimal rotations D K) D yi D z in turn 
[III, (15.4)] and writing the corresponding infinitesimal unitary 
operators in the form 

di/j = j(L Xf Ly, L z )*f r, 


we find 


L x 


-My- 

iV7)z 



( 1 . 2 ) 


hS> is accordingly the moment of momentum [cf. II, (4.9)]. 

On going over from one electron to two, the vectors of system 
space are the functions ^(x^y^ ; x^^) of the Cartesian co- 
ordinates of both electrons. The unitary transformation 
U : */j -> i/j r induced in system-space by the rotation $ is now 
defined by the equation 

f ; 4>44) = ; x. 2 y 2 s 2 ), 


where x[y[z[ and x r $ 2 4 are obtained from x 1 y 1 z 1 and x^y^z 2 
by the same orthogonal transformation This situation can 
be described as follows : The state space 3 ? 2 of the system con- 
sisting of two electrons is SR X 9? and the representation @ a 
induced in it is 6 X 

This representation is, as we see, determined by the kine- 
matical constitution of the system alone, and is in no way 
influenced by the dynamical relationships ; the rule for X - 
multiplication for the induced representation on composition 
of partial systems presupposes only kinematical, not dynamical, 
independence of the partial systems. 

We can, without further trouble, formulate the situation 
discussed above in terms of the general scheme of quantum 
mechanics in a manner which is independent of the particular 
assumptions of Schrodinger's scalar wave theory. This is all 
the more important since it has all along seemed doubtful 
whether the matter waves could be described in terms of a 
single state function \jj. We set up an analogy between the actual 
displacement of the state of the system in time and the virtual 
change produced by an arbitrary rotation of space. The 
transition from time t to time t r changes the (arbitrary) state 
J at time t into a state at time t r } obtained from j by a unitary 
transformation U corresponding to a displacement of the time 
axis which sends t over into t r . The displacements along the 
time axis constitute a one-parameter continuous group which is 


tk 
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isomorphic with the group of transformations JJ associated 
with them in system-space. The former group is generated 
from the infinitesimal displacement t -> t + dt, and it therefore 
suffices to give the infinitesimal unitary operator 

associated with it in system-space. We called the Hermitian 
operator H the energy . 

On subjecting the physical system (or the spatial co-ordinate 
system in terms of which it is described) to a virtual rotation s } 
the state j: goes over into another state Since nothing 
intrinsic to the system is changed thereby and since the state 
space 9ft is linear and unitary, the transition U(s) : J -> j' 
associated with s must also be linear and unitary. As in the 
case of the group of actual displacements in time, this group 
of virtual rotations in space must induce a certain representation 

in the system-space SR ; this latter is more properly to be 
considered as a ray, rather than a vector, space. But if we go 
over from the rotation group to the unimodular unitary group 
u 2 (or 1 X 2 ) by stereographic projection (III, § 8) and take this 
latter as fundamental, it is, in accordance with III, § 16, not 
necessary to distinguish between ray and vector representations. 
The group of proper rotations can be generated from its infini- 
tesimal operations, and we may take as a basis for these the 
infinitesimal rotations D X} D y , D z about the x -, y-, and 2 -axis. 
It then suffices to know the infinitesimal unitary transformations 

di = l(M x , My, M x )i 

which they induce in system space. We call the real physical 
quantities of the system which are represented by the Hermitian 
operators M m M jh M z the x y~, 0 -components of the moment 
of momentum 3Jt. In order to express them in terms of the 
usual units they must, as was also the case with the energy, 
be multiplied by the quantum of action h. The moment of 
momentum plays the same rdle with respect to the virtual rotations 
of space as the energy with respect to the actual displacements in 
time , 

One argument for the appropriateness of our definition of mo- 
ment of momentum is that in the case of the Schrodinger theory 
it leads to the usual formulae of classical mechanics. As a further 
justification we prove the general theorem that the moment of 
momentum so defined is constant in time. We saw in II, § 8, 
that the necessary and sufficient condition that the physical 
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quantity represented by the Hermitian operator A be constant 
in time was that A commute with the Hermitian operator H 
induced by the infinitesimal displacement of time. In exactly 
the same way we can show that the commutativity of A with 
M X) M y) M z constitutes the necessary and sufficient condition 
that the quantity represented by A remains unaltered under the 
virtual proper rotations of space, i.e. that A is a scalar with 
respect to these rotations. Now the energy is a scalar, hence 

HM X — M X H = 0, • • .. 

But, on the other hand, these equations assert that M Xl M yy M z 
are constant in time. 

The infinitesimal rotations generate only the group of proper 
rotations ; in order to obtain the complete orthogonal group we 
must supplement them with the reflection i in the origin, or 
extend the group u 2 to the group u 2 by the addition of the ele- 
ment l (III, § 8). l will induce a unitary operator I in system 
space which commutes with all U(s), in particular with the 
moment of momentum 3Jt = (M Xi M y , M z ), and which satisfies 
the equation II = 1 ; this shows that I is Hermitian, as well 
as unitary. A quantity A which is unchanged by reflection 
must commute with I ; hence, in particular, the energy H 
must commute with I . The physical quantity represented by /, 
which we call the signature , is constant in time, as it commutes 
with H . It has, in common with all quantities arising in group 
theory which are not associated with infinitesimal operators, 
no analogue in classical mechanics. 

We reduce the total system -space into invariant sub-spaces 
with respect to the group of displacements in time ; such an 
invariant sub-space is carried over into itself by the generating 

infinitesimal operation cty = if/j. Since we are here dealing 

with a one-parameter Abelian group, or with a single operator H , 
this reduction can be carried to the point in which all the con- 
stituent sub-spaces are 1-dimensional. The states contained in 
one of these invariant sub-spaces we call quantum states . 

We now proceed in exactly the same manner to reduce the 
representation 91 induced in system-space by the group of rota- 
tions into its irreducible constituents $),. We make use of the 
fact that these are known to us a priori ; only the number of 
times they appear in 91 depends on the particular representation 
91. (Of course, we have not as yet shown that the % really 
constitute a complete system of irreducible representations of 
b 3 , and it may seem risky to apply the process of reduction to 
the oo -dimensional representation 9Z. This procedure can, 
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however, be justified on the basis of the fact that b 3 is a closed 
group. But in the final formulation of quantum mechanics it 
will not be necessary to base our conclusions on such general 
considerations, as the reduction into 2); will be obtained by 
elementary means.) The entire system-space SR is thus decom- 
posed into sub-spaces SR;, SR ; -,, • * • such that SR; is of dimension- 
ality 2 j + 1 and the representation induced in it by the group 
U 2 is 2);. On adapting the co-ordinate system in system-space 
to this decomposition the variables fall into classes 

x(m) ( m = j, j — 1, • • — j ) ; 

x'(m') ( m ' = j\ j’ - 1, ; • • • ; 

under the influence of an arbitrary transformation o of u 2 , 
applied to the variables £, rj the co-ordinates of system-space 
transform in accordance with the law 

x(tn) 7 =r= t (i + k = 2/, i — k = 2m). 

yi ! k ! 

With the reduction of SR or SR is associated the reduction of the 
angular momentum SIR ; in the sub-space SR; the components 
of SIR are given by III, (15.9), from which it follows that the 
square M 2 of the moment of momentum has there the fixed 
value j(j + 1). (It is evident from general considerations 
that M 2 must be a multiple of the unit matrix in SR;, for it is 
a scalar and must therefore commute with all the operators of 
the irreducible representation 2);.) If the state of the system is 
represented by a vector lying in SR;, the ^-component of its 
moment of momentum is capable of assuming the values m = /, 
j — 1, ; the ^-component naturally only apparently 

occupies a preferred status, due to the fact that the co-ordinates 
in SR; were chosen in a manner which differentiated the z-axes 
from the others. That M Z) M 2 can a priori assume only discrete 
values m , j(j + I) is essentially due to the fact that the rotation 
group is closed ; since the group of displacements in time is open, 
the analogous result for the energy need not in general hold. 
In this connection we wish to emphasize again that the operator 
H depends on the dynamical relationships existing in the system,, 
whereas the representation $R induced by the group of rotations 
is determined only by the kinematical situation (number of 
elementary particles, etc.). The signature I also assumes a 
definite one of its values ± 1 in each sub-space SR;. For lack 
of a better name we call the states which lie in the sub- 
space SR;, which is invariant under the group of rotations, 
44 simple ” states of inner quantum number j . We must 
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be prepared to find that j may here assume half-integral as well 
as integral values, in contrast with the Schrodinger theory* 

On uniting two kinematically independent systems, with 
system-spaces 91, SR' in which the rotation group induces the 
representations 91, 91', the total system has as system -space 
91 X 91', in which the representation 91 X 9T is induced. In 
particular, the moment of momentum of the total system is 

(3ft X 1) + (1 x W) 

where 9ft and 9ft' are the angular momenta of the two partial 
systems. The theorem that the moment of momentum behaves 
additively with respect to composition is contingent only on the 
assumption that the parts are kinematically independent, 
whereas the corresponding theorem for energy applies only if 
they are dynamically independent, i.e. in the absence of inter- 
action between the parts. This difference is based on the fact 
that whereas the energy represents that actual change of state 
in the course of time, the moment of momentum represents 
the virtual change associated with a fictitious rotation. We 
reduce 91, 9 V into the invariant irreducible sub-spaces 91,, 9^ 
respectively, i.e into the simple states of the two partial systems 
having inner quantum numbers, /, j'. The Clebsch-Gordan 
equation (III, § 5) 

X 5V = ® j tj'-i + • • • + (1.3) 

then tells us : If the two parts are in the simple states with inner 
quantum numbers j, j' then the whole has each of the simple states 
with inner quantum number 

J — i + f, j | j — j' | (1.4) 

associated with it f each exactly once . To include the signature 
we must add : If the parts have as signatures the values 8, 8' 
(8 = ± 1), the signature of the whole has the value 8 8'. 

Compare the results which we have obtained with the 
corresponding results in classical mechanics. In both the moment 
of momentum is constant in time and the moment of momentum 
of the whole is equal to the sum of the moments of momentum 
of the two parts. Denoting the magnitude of the moment of 
momentum in classical theory by j ) we have, in agreement with 

zjzj+f, 

for the resultant of two vectors of magnitudes j, j' is a vector 
whose magnitude J lies within these limits. Quantum mechanics 
demotes from classical mechanics in the following three respects .* 
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1. In quantum mechanics the square of the moment of momentum 
is j(j + 1), ™ classical mechanics it is j 2 ; 

2. Here j can assume only the discrete values 0, J, 1, f, • * *, 
there it may have any nonoiegative value ; 

3. Here the J obtained on compounding two partial systems 
can assume only those values between | j — j' j, j + f which differ 
from them by an integer , there it can assume any value between these 
limits . 

Already before the rise of the new quantum mechanics a 
semi-empirical description of the regularities observed in spectra 
had been given with the aid of a vector model consisting of the 
vectorial moments of momentum of the individual electrons 
and <ff the atom as a whole ; the observations, assisted by the 
older quantum mechanics, had already led to these three modi- 
fications of classical theory. 1 

The reader will perhaps have wondered why we consider 
only the virtual rotations of space and not the translations, 
which must also be taken into account in order to arrive at a 
complete description of the homogeneity of space. The reason 
for this is that in studying atoms or ions we treat only the 
electrons as particles, taking the nucleus as a fixed centre of 
force situated in the origin. That this is at least approximately 
correct is due to the fact that the mass of the nucleus is many 
times the mass of the electrons. Space is thereby transformed 
from a homogeneous into a centred space ; such a procedure 
naturally allows us to consider only atoms or ions, which have 
a single nucleus. Diatomic molecules are accordingly described 
with the aid of the 1-parameter group of rotations about the 
axis joining the two nuclei, and not by the full 3-parameter 
group of rotations of space — to this we must add reflection in 
the plane which bisects the axis perpendicularly in case the two 
nuclei are physically equivalent. 2 If we are dealing with three 
or more fixed nuclei the symmetry either disappears entirely or 
is reduced to at most a finite group of rotations. 3 


§ 2. Simple States and Term Analysis. Examples 

To each characteristic value E of the energy H there belongs 
a definite sub-space 5ft' of 5ft, the sub-space of quantum states 
with enemy level E' ; it consists of all states J which are trans 
formed Tnto • £ by the operator H and is accordingly the 
characteristic space SR(£') associated with the charactenstic 
value E' of H. Since the energy is a scalar, the considerations 
applied in the preceding paragraph to the total space SR can a 
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be applied to 9i' : 9t' is invariant under the operators induced 
in system-space by the rotation group and is consequently the 
carrier of a certain representation of this group, which can be 
reduced into its irreducible constituents. If the energy levels 
are of at most finite multiplicity we are faced with the problem 
of reducing only representations of finite degree. Accordingly 
91 is decomposed into the “ simple spaces ” Sft, associated with 
the rotation group in such a way that not only the square of 
the angular momentum and the signature have definite values 
in 9f>, but also the energy has a sharply defined value 1 Ej. This 
energy level E t is necessarily (2 j -f l)-fold degenerate ; we 
speak of an accidental degeneracy when the energy levels of 
different simple sub-spaces 9f* are equal. 7, M Z} M 2 and H 
are all simultaneously in diagonal form ; that this is possible 
is due to the fact that these four operators all commute among 
themselves. In this way the reduction into simple states can be 
employed in term analysis : each energy level E i possesses an 
inner quantum number j which gives the term the natural 
multiplicity 2; + 1. 

On subjecting the atom to a perturbing field which destroys 
its natural spherical symmetry this (2 j + l)-fold term is broken 
up into 2 1/ + 1 terms. Let the perturbation, i.e. its Hamiltonian 
function W, possess axial symmetry about the s- axis ; if Ej 
possesses no accidental degeneracy, then in accordance with the 
theory of perturbations the perturbed energy levels are given to 
a first approximation by the portion of the Hermitian operator 
W in which 9i* intersects itself : 

x(m) -> £W(m, m) x(m') [m = j — 1, • • •, — j ). 

The rotation about the s-axis with meridian angle <j> transforms 
x(m) into mf>) • x(m) t and in virtue of the symmetry assumed 
for W this correspondence of 91* on itself must also be represented 
by 

e(— m(j>) • x(m) = £W(m t m') • e{— m'<f>) x[m , ) ) 
or 

W(m, m') e[(m — m')<f > ] = W(m, m'). 

But this means that all elements W(m, m') except those in the 
main diagonal vanish, whence 

E, + W(m, m) (2.1) 

are the 2 j + 1 perturbed terms. The quantum number m , 
which is capable of assuming the values j — 1, ■ • •, — j t 
thus serves to label these components. Perhaps the most 
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important axially symmetric perturbation is that due to a 
homogeneous magnetic field in the direction of the 2 -axis 
(. Zeeman effect ) ; because of this m is called the magnetic 
quantum number. The inner quantum number j of a term 
can be determined spectroscopically by counting the number of 
terms appearing in the Zeeman effect. Sommerjeld first con- 
cluded, from the spectroscopic data, that j as well as m must be 
allowed to assume half-integral values. If we consider the 
Zeeman effect to be described by the analogue of the classical 
formula II, (12.5) then 

W = ^(m-koM„ 0-‘M, (2.2) 

and W is rigorously in diagonal form : 

W(m, m) = hom . (2.3) 

Our analysis shows that the breaking up of energy levels due 
to an axially symmetric perturbation parallels the reduction of 
an irreducible representation of the rotation group b 3 when this 
is restricted to the group b 2 of rotations about the 2 -axis : by 
this is reduced into the 2 j + 1 one- dimensional representations 
which we have previously denoted by ® (m) : 

x(m) e(— m<f>) * x(m). 

If two kinematically independent parts, which are in the 
simple states 5ft>, SftJ,, are compounded together, the state of 
the composite system is in the (2 \j + 1)(2/' + 1) -dimensional 
product space = 91,- X If the parts have the energies 

j E j) Ej, then the whole has the energy Ej + E' r , assuming no 
interaction between the parts. Introducing a weak interaction 
between the two partial systems and assuming that there is no 
accidental degeneracy, i.e. assuming that all the remaining 
energy levels of the unperturbed system are different from E^t, 
it suffices, to a first approximation, to consider the section 
<i/> of the energy operator H in which SW intersects itself ; 
it is an Hermitiarf correspondence of on itself. We can 
apply the considerations, which were applied above to the total 
system-space 5ft X 5R', to each of these 5ft;/' : 5R/// is to be de- 
composed into sub-spaces belonging to numerically distinct 
characteristic values of <H>. The rotation group induces a 
certain representation in each of these sub-spaces, and this 
can be further decomposed into its irreducible constituents. 
The result is that Sft/ X is, in accordance with the Clebsch- 
Gordan series, reduced into the simple spaces 5ft/, J = j + 
j y' — i f • . ^ |y — y'|, in such a way that in each of them 
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the energy <H> has a definite value Ej. Different Ej can only 
“ accidentally ” have the same numerical value. Consequently 
the term E }i > is broken up by the perturbation into terms Ej 
in exactly the same way as the representation x 2V is 
reduced into the irreducible representations But this is 

only correct to the approximation characteristic of perturbation 
theory. As we have seen above, an inner quantum number 
J can be rigorously ascribed to a term E ; in the approximation 
with which we have been dealing here there is associated with 
it in addition the inner quantum numbers ] ) j' of the parts, in 
the last analysis of the electrons themselves : the energy level 
E arises from a definite term of the unperturbed system by 
interaction of the two parts. Such an association is rigorously 
possible for “ simple states,” but the rules based on it lead only 
indirectly and approximately to an analysis of the terms. 4 

Examples 

If we take the Schrodinger scalar wave theory to be valid 
for a single electron, then a simple quantum state of the electron 
in the field of the nucleus is characterized by the principal 
quantum number n and the azimuthal quantum number l (we 
here use the word “azimuthal” instead of “inner”). Such 
a term is (2/ + l)-fold degenerate, and we assume there is no 
further accidental degeneration. The moment of momentum 
is represented by the operator £ taken over from classical 
theory ; the square of its absolute magnitude is 1(1 -j- 1) and 
the signature has the value ( — l) 1 . If / electrons come together 
to form an atom we obtain a term, neglecting interaction between 
the electrons, 

£(Vi) + E(n 2 l 2 ) + • • • + E{n f l f ) (2.4) 

of multiplicity (2^ + 1) • • • (2 1, + 1). The quantum numbers 
n and l refer to the individual electrons. The interaction causes 
a separation which parallels the complete reduction, obtained 
with the aid of the Clebsch-Gordan series, of , 

®i, X <$> 1 , X • • • X (2.5) 

into its irreducible constituents 3>x with total azimuthal quantum 
number L Each such term is associated with the quantum 
numbers 

( n i h, n 2 l 2 , • • •, riflf ; L). (2.6) 

If / ^ 3 certain % L appear more than once in (2.5), and we may 
therefore have several (2 L -j- l)-fold terms associated with the 
same set (2.6) ; these must then be distinguished from each 
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other by some further index. The square of the total moment 
of momentum is L(L -f- 1) and the signature ( — l)h + h+-- • +*y 
In spectroscopy it is usual to characterize the values Z = 0 1 2 3 
4, • • • by the small Latin letters 5, p, d,f, • • • and the values 
L = 0, 1, 2, 3, • by the corresponding capitals S, P , D, F, • • \ 
We cannot expect the scalar wave theory to be correct, 
but must be prepared to describe the state of the wave field 
in terms of a quantity i// with several, say a, components 
(fa, fa, ' • ■> fa), i- e - by a covariant quantity of a definite kind 
21. Each component is a function of the spatial co-ordinates 
xyz ; the components will depend on the choice of the Cartesian 
co-ordinate system in such a way that on going over to a new 
co-ordinate system by the rotation 5 the components will undergo 
among themselves that transformation which corresponds 
to 5 in the representation 31. Again, consider b 3 replaced by it 2 
as the fundamental group. The general component fa(xyz) of 
the “ vector ” </» has two indices, the index a running from 1 to a 
and the index (xyz) running through all the points of space. 
Let 9!( be the vector space of functions faxyz ) and 3L the 
a-dimensional vector space ; the state space of a single electron 
is then 92„ X 9ft(. Under the influence of the rotation s which 
sends xyz into x'y'z' the state i/t goes over into the state i p' 
defined by the equation 

fa(x'y'z') = 2X, , <A „(xyz), fl a„ fi f = A(s) ; 

P 


the representation induced in system-space is accordingly 
= 91 X The moment of momentum 9ft of the electron 
consists of two parts : 


STC = (@ X 1) + (1 X 8), (2.7) 

the first of which refers to the a-dimensional “ spin space ” 9ff a , 
the second to the “ translation space ” 91/. (1 x L x ), or simply 


L m is the operator l(y± - z~) 


which acts on each of the 


a components in the same way ; it affects only the index (xyz), 

leaving the index « unaltered. jS* is the unitary transformation 

corresponding to the infinitesimal rotation about the #-axis in 
the representation 21; (5* X 1 ), or simply S x , consequently 

affects only the index a and leaves (xyz) unchanged. Only 
the part 8 appears in classical mechanics ; we call it the orbital 
moment of momentum, and the remaining part <3 the spin 
moment of momentum, or simply the spin. Its appearance 
is unavoidable so long as the wave quantity i f> is not simply a 
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scalar or a set of scalars. Each of the two parts satisfies separ- 
ately the commutation rules III, (15.7), but in general only the 
total angular momentum satisfies the law of- conservation. If 
the quantity is of a simple kind, i.e. if 21 is an irreducible 
representation ® 5} then a = 2s + 1 and the spin © is equal to 
the moment of momentum 9JL associated with the representation 

Since the Schrftdinger theory has proved itself at least 
approximately correct, one should assume that to a first ap- 
proximation each of the components ifs a satisfies the Schr6dinger 
scalar wave equation. So long as we consider this approxima- 
tion, the a components have only the effect of multiplying the 
multiplicity of each energy level by a . But in reality the correct 
differential equations must contain a term, the “ spin per- 
turbation,” which introduces a coupling between the various 
components ifi x . The electron can thus be considered in 
abstracto as a composite system, consisting of the electron 
translation with system-space 31* and the electron spin 
with system-space 9t rt ; the spin perturbation is the weak inter- 
action between these two. Because of this the method of 
composition can here be applied. Let 21 = Decompose the 
translation space 91* into the (2/ + 1) -dimensional sub-spaces 
9t(nZ) ; the corresponding energy term E(nl) with azimuthal 
quantum number l has, on neglecting the spin perturbation, the 
multiplicity a(2l + 1) and its characteristic space is the space 
ffta X Sft(ttZ) of the same dimensionality. On taking the first 
order spin perturbation into account this term is separated 
into the terms E, with inner quantum number j and 'multiplicity 
(2 j 1) in a manner paralleling the decomposition of the repre- 

sentation X S) t into its irreducible constituents : 

X % = j = s + l, s + l- 1, • • •, |*- 4 (2.8) 

with the aid of the Clebsch-Gordan series. Care must be taken 
to differentiate sharply between the azimuthal and inner quantum 
numbers l and j. The latter is capable of assuming the values 
given in (2.8) ; whenever s the number of different terms in 
such a “ multiplet ” is 2s + 1. 1? is approximately equal to 

the constant 1(1 + 1), S 2 is approximately equal to the constant 
s(s + 1), and M 2 is rigorously constant and exactly equal to 
j(j -f- 1). We can thus speak of the azimuthal quantum number 
of an actual energy term only to within the approximation 
characteristic of perturbation theory. It is well to set forth 
these considerations beforehand and to approach the spectro- 
scopic data, as we shall in § 4, with them well in mind. 
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§ 3. Selection and Intensity Rules 

We return to the consideration of our system as a whole, 
without resolving it into its individual electrons, and again 
denote the total inner quantum number by j. Let A be any 
physical quantity of the system, and let it be represented by 
the Hermitian form A ; we write that portion of this form in 
which SR,- intersects $R), in the form 

£a(mm')x(m)x'(m'), (3.1) 

where the indices m, m 1 run through the values 

W = h j - 1, * • •, — j; m’ = j', j ' - 1, • • •, - f. (3.2) 

If the quantity A is a scalar, the operator A commutes with the 
operators U(s) induced in system-space by the rotations s. 
On decomposition into these irreducible sub-spaces 9f,, Sft', it 
follows from the fundamental theorem III, (10.5), of the theory 
of representations that the section (3.1) of A corresponding to 
the transition % -> SR), is zero if j' #= j and a multiple of the 
(2; -|- 1)- dimensional unit form 

2Jx(m) x'(m), 

m 

iff = j. 

An analogous situation exists for the group b 2 of rotations 
about the 2 -axis. With respect to it the total system-space 
decomposes into 1-dimensional invariant sub-spaces SR< m > in 
which the rotation with angle <f> induces the representations 
3)(™> : x(m) ~>e(— mf>) x[m). If we only assume that the physical 
quantity A possesses axial symmetry about the 2 -axis it follows 
that the coefficient a(mm') is necessarily zero when the magnetic 
quantum numbers m and m' of the initial and final states are 
different. 

We now consider a vectorial quantity q with the three 
components q x , q y , q 2 instead of the scalar quantity A. This 
is of particular importance because such a quantity, i.e. the 
electric dipole moment q of the atom, determines the interaction 
between the atom and radiation — to that approximation in 
which the linear dimensions of the atom may be neglected in 
comparison with the wave-length of the emitted light. If the 
degeneracy of the energy level E, is destroyed by an external 
axially symmetric perturbation, e.g. a homogeneous magnetic 
field in the direction of the 2 -axis, then the spectral line caused 
by the transition SR* — > 3 ft), from the term Ej to E), is broken 
up into the lines associated with all possible transitions 
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(3?,, tn)-+ (%>, m'). On calculating the part of the Hermitian 
form representing the electric dipole moment in which the sub- 
space 9t> intersects : 

Sq (mm') x(m)x' (m') , (3.3) 

the ratios of the squares |q(mm')| 2 of the absolute values of its 
coefficients determine the relative intensities of these (2 j + 1)(2;'+1) 
lines. Since q z is axially symmetric about the 2 -axis q z (mm') = 0 
unless m' = M) we thus have the selection rule 

q z : m^m (3.4) 

for the 2 -component of the electric moment. On performing 
the rotation with angle about the 2 -axis x(m), q x + iq y> q x — iq y 
are multiplied by e(—m<f>) f e(<f>), e(—$) respectively. Since 
x(m)x'(m') is therefore multiplied by e[(m ■— we obtain 

the selection rules 

4x + i<ly : m->m— 1, q x — iq y : m->m + 1 (3.4') 

for the x - and y-components of q. Only the transitions 

m -* m — 1, w, m + 1 (3.5) 

of the magnetic quantum number are allowed ; the first and the 
last generate two waves which are circularly polarized in the xy- 
plane in opposite directions , and the remaining transition m 
generates a wave which is linearly polarized in the z- direction. 
If the equation (2.3) holds for Zeeman effect, the wave number 
of the component m -> m' is displaced by an amount o(m — m') 
from its unperturbed value. Thus in 44 normal Zeeman effect ” 
we obtain instead’ of (2 j + 1)(2/ + 1) components only three, 
whose polarization is as described above and whose wave numbers 
are displaced by the amounts 0, ± o. That the resolution of 
the two terms E j} Ej>, is almost entirely hidden is due to the 
fact that the factor of proportionality ho in (2.3) has the same 
value for both terms. Fortunately most of the cases actually 
observed show 44 anomalous Zeeman effect in which the resolu- 
tion of the terms can be seen clearly ; in order to explain it 
we must change the expression (2.2) for the perturbation due 
to the magnetic field. But the above selection rule for the 
magnetic quantum number, which has been obtained from 
fundamental principles of group theory, is valid in all cases. 

The selection rule for the inner quantum number j is obtained 
in an analogous manner. The three components q X) q yi q z of q 
suffer the transformation ^ among themselves when the x(m) i 
x'(m') are subjected to the transformations corresponding to 
s in the representations 2)*, 2*/ respectively. Or, if we wish to 


SELECTION AND INTENSITY RULES 199 

express it in terms of u a instead of b 3 , 5 is that transformation 
which is associated with the element a of u 2 in the representation 
®i* This is, of course, merely an expression of the fact that q 
is a vector. Now, in accordance with the terminology intro- 
duced in III, § 14, (3.3) is a vectorial quantity in the representa- 
tion space of X and we are interested in determining 
how many linearly independent quantities of this kind there 
are. Their number is given by the number of times $ x is 
contained in X or 2), x as an irreducible constituent. 
But in accordance with (1.3) $ x occurs in % x %, exactly 
once if 

f — j — 1 or j or j -f- 1 

and otherwise not at all, and we must further exclude the case 
j — 0 , / =-■ 0 . We thus obtain the selection rule 

j^j-1, j, j + 1 (3.6) 

with the proviso that 0 -> 0 does not occur. Since there exists 
but one linearly independent vectorial quantity in the repre- 
sentation space of % X 3V in the cases in which the selection 
rule is satisfied, the components of q [m, m') are determined by 
purely group-theoretic considerations to within a constant factor 
of proportionality. 

In order to calculate the vectorial quantity (3.3) for/ = j — 1 
we proceed as follows. Let £ , tj ; £', / be two arbitrary points 
on the unit sphere which transform cogrediently under u. 
1/ + fj-q' is then the fundamental invariant, and the three 
forms which are obtained from 

Yi tie + mV (3.7) 

by multiplication with 

— £ a , (3- 8 ) 

transform in the same way as the (x + iy)- } (x — iy)-, z- com- 
ponents of a vector, respectively. They are linear in the 
monomials £ r rj* of degree k + 2 = 2j and in the monomials 
of degree k = 2 \j'. Introducing 
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as co-ordinates in the representation spaces of 2)y, 5V we find 
that the three forms above are of the type (3.3) with / = j — l. 
For example, we obtain for the ( x + fy) -component 

_ p sr t£ 

(r — 2) ! 5 ! c ^-*VV ! 5 ! 

<r-2)+*=fc 

= — ZV(j+m)(j + m- T)x(m)x'(m - 1). 

m 

In agreement with the selection rule m-+ m — 1 there occur here 
only those terms for which m f = m — 1. Calculating the 
(x — iy)- and 0 -components in the same way, we find for the 
transition 

j -*■ f = j — 1 : 

(q x + iq y ){m, m — l) = — V (j + m)(j + m — l), 

(q x — iq a )(m, m + 1) — V (/ — m){j — m — 1), (3.9) 

q z {m, m) = V (j + m)(j — m). 

In order to calculate the components for the transition j = / 
we must replace the factors (3.8) by 

which also transform like the (# + iy)-, (a?-— iy)- and 0 -com- 
ponents of a vector. Finally, for the transition / = j + 1 we 
must replace (3.8) by r /' 2 , — £' 2 , £y. Since the angular mo- 
mentum 2R is a vector, the formulae for the transition j -> j 
must naturally agree with those already obtained for 3JI [III 
(15.9)], and since q is Hermitian the formulae for the transition 
j-> j + 1 must agree with those obtained by taking the 
Hermitian conjugate of the components for the transition 

j-*] — !• 

i -*■ f = j - 

(qx + kv)( m > m — \)= v \j + m)(j - m + 1 ) , 

(q« - *$»)(*» m + l) = V(j- m)(j + m + 1), (3.9) 

q t (m, m) = m. 

j -> j 1 = j + l . 

(qx + iq y )(m, m — 1) = V{ j — m + l)(j — w + 2), 

(q x — iq y )(m, tn + l) = — V p’ + wt + 1)Q~ + w + 2) , (3.9) 
m) = V(j + m + l)(j — m + 1). 


2 )! 5! 


Vr(r - 1) 
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In each of these three sets of formulae the right-hand sides are 
determinate only to within a common factor of proportionality 
which is independent of m, but which can be completely deter- 
mined only by integrating the wave equation of the dynamic 
model of the atom, and not by the theory of groups alone. 
The coefficients which do not occur explicitly in the above 
formulae are all null. The squares of the absolute values of these 
coefficients yield the (; rational 1) intensity ratios of the components 
into which a line is split by the perturbation. 

Already before the rise of the new quantum mechanics the 
intensity formulae (3.9) for the components of a line emitted 
under the influence of a magnetic field were obtained 'from the 
observational data under the guidance of the correspondence 
principle. 5 In the new quantum mechanics they are, as we 
have seen, a consequence of the most general principles, and we 
would find ourselves in serious difficulties if they were incorrect. 
Nevertheless it is to be remembered that they can be invalid 
(1) if the spherical symmetry of the system is destroyed by 
external perturbing fields, or (2) if for short wave-lengths the 
interaction between matter and radiation is no longer determined 
primarily by the electric dipole moment. 

Since the dipole moment is a proper vector, as the components 
q X) % g° over mto on reflection i in the 

origin, the representation induced on them by U 2 has as 
signature — 1. If the signatures of 9},, are 8, 8', then under 
the influence of the reflection i (3.3) is multiplied by the factor 
88'. The coefficients q(mm') must accordingly all vanish unless 
SS' = — 1 : the selection rule for the signature is 

S-> - 8. 

If the individual electrons are governed by the scalar wave 
theory the total azimuthal quantum number L of the atom 
can jump only to L — 1, L or L +■ 1, while the sum of the azi- 
muthal quantum numbers of the individual electrons Z x + / 2 + * * * + If 
can change only by an odd integer [Laporte's rule). In the case 
of a single -electron, / = 1, only the transitions 1-+ l ± 1 are 
consistent with these rules ; this result has already been obtained 
in II, § 5, from the theory of spherical harmonics. 

The formulae (3.9) allow us to solve a problem which we shall 
here, for the sake of future application, introduce from the 
physical standpoint. A partial system in the simple state % 
is compounded with a second in the simple state to form 
a single system. In 91^/ = % X u 2 induces the representa- 
tion 2) = % x 3V ; let the corresponding moment of mo- 
mentum be 3R. On adapting the normal co-ordinate system 
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in SR,,/ to the complete reduction of 3) into its irreducible con- 
stituents 9JI is broken up into square sub-matrices 3Ji/ of 
length 2/+1, arranged along the principal diagonal, corre- 
sponding to the decomposition of 3t^' into sub-spaces 31/. But 
the same is not true of the moment of momentum 95t ; X 1 of 
the first partial system, and we wish to determine the portion 
of this matrix in which 31/ intersects itself. That is, in physical 
language, we wish to determine the temporal mean value <331, * 
of the moment of momentum of the first system in the state 
defined by the quantum numbers j, j ' ; J of the two parts and 
the whole. We assume that the interaction between the two 
parts resolves the energy level E into distinct levels Ej on 
applying the theory of perturbations. Since 30 % is a vector we 
know, from the same considerations as we applied to the electric 
dipole moment above, that the portion of it corresponding to 
the transition J J must be a multiple of $31/ : 

^33lj X 1 )>j = Kj • 301/. (3.10) 

In order to evaluate the proportionality factor k we construct 
the scalar product of the matrices (3K* X 1) and 3£ft ; since 

3R - X 1) + (1 X SR/0 
these two matrices commute and we have 


(1 x m,')* - W + (% x l) 2 - 23!R(3R, x 1) 

or 

X 1) = j(j + 1) + 1) + 3R 2 , (3.11) 

for since in the original co-ordinate system (30fy X l) 2 was 
j(j + 1) times the unit matrix, it remains the same in the new 
co-ordinates. And, on the other hand, 301(30^/ X 1) is equal 
t0 K J ’ JU + 1) times the unit matrix in the sub-space 3L, as 
follows from (3.10). Hence from (3.11) 


J{J + 1) = i(j + i) - f(j' + i) + J[J + l), 
, id + i) - f(f + 

2 /( 7 + 1 ) 


1) 


(3.12) 


§ 4. The Spinning Electron, Multiplet Structure and 
Anomalous Zeeman Effect 

We have hitherto ignored the fact that the terms of the 
alkali spectra , characterized by the two quantum numbers n, /, 
are in reality not simple. Each of these terms— with the ex- 
ception of the 5 terms l = 0— actually consists of a fine doublet. 
By § 2 the (n, l) term should be resolved into 21 -f- 1 components 
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in a magnetic field ; instead we find that one of the doublet 

terms breaks up into 21 components and the other into 21 -p 2. 

We should accordingly ascribe to them the inner quantum 


numbers j = l — 



+ g, respectively. 


Our general considerations immediately give us a hint as 
to how this discrepancy is to be explained. The quantity i p 
describing the wave field is not a scalar, but is instead a covariant 
quantity of the kind 3D*, having two components (ip 1} p 2 ). This 
is the theory of doublet phenomena as developed by W. Pauli* 
It seems indeed easy to arrive at this conclusion after the 
preparation of the preceding paragraphs, but historically this 
systematic foundation was developed only after Pauli's dis- 
covery. It is quite immaterial whether we associate the matrix 
+ 1 or the matrix — 1 with the element i in the representation 
3D* of U 2 . Taking the first of these alternatives, the signature 
has the value (— l) 1 in the quantum state ( nlj ) ; hence Laporte's 
rule remains rigorously correct on taking the spin into account. 
We have as further rigorous selection rules those concerning 
the total inner and the total magnetic quantum numbers. In 
the representation 3D* the transformation or itself corresponds 
to the element or of u 2 , and by III, (15.6), the spin moment of 

momentum is where © is the vector already defined with 


components 


0 

1 

Sy = 

0 —i 

, S z = 

1 

0 

1 

0 

) y 

i 0 


0 

-1 


We shall not as yet attempt to find the specific effect of the 
spin perturbation on the wave equation. This was done origin- 
ally by picturing the electron as a small material sphere, the 
rotation of which gave rise to the spin ; the additional moment 
of momentum required by spectroscopic observations was first 
introduced in this way by Goudsmit and Uhlenbeck. 1 Since 
S z is capable of assuming only the values i 1 it appears as if 
the spin axis can only be quantized along the positive or negative 
2 -axis ; we need not go into the false conclusions this assertion 
can lead to on interpreting it literally. The spin perturbation 
must appear in going over from classical to relativistic mechanics. 
The terms of the hydrogen atom, calculated in accordance with 
the scalar non-relativistic wave mechanics, depend only on the 
principal quantum number n, but the theory of relativity intro- 
duces a correction which causes the terms corresponding to the 
various values of l to split apart and form the so-called fine 
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structure. We should therefore expect the same scheme of 
terms in hydrogen as in the alkalies, but observation shows 
that the doublet separation of an l term into two terms with 

j = l is just such that two terms with the same /, but with 
" 1 

different / = j ± exactly coincide. Hence the spin per- 
turbation in hydrogen agrees quantitatively with the separation 
caused by the relativity correction. 

The alkali doublets show anomalous Zeeman effect. Other 
elements, such as alkaline earth metals, have (in addition to 
triplets) a system of singlet terms, and singlet terms always 
show normal Zeeman effect in a magnetic field. It therefore 
seems probable that the anomalies in Zeeman effect are closely 
connected with the spin. The magnetic separation of an alkali 
term is quite independent of the principal quantum number n ; 
all the terms of a series behave in the same way. A term (/, j) 
splits up into 2 j + 1 equi- distant components, characterized by 
the magnetic quantum number m , but their separation is hog 
instead of ho , where g is a rational function of l and j (the “ Land6 
g-factor ”). The energy value of the component m is therefore 
displaced by an amount 

hog • m (m = /, j — 1, • • — j) (4.1) 


from its unperturbed value. The empirical formula for the factor 
g, which is due to Lands, is 


g = 


2J + 1 
21+ V 


(4.2) 


This formula holds for weak magnetic fields , in which the separa- 
tion is of a smaller order of magnitude than the doublet separation. 

If l = 0, j = we have in particular g = 2. 

This latter fact gives a hint toward the solution of the puzzle : 
If the total moment of momentum consisted only of the spin 
(S = 0), its magnetic effect would be twice as great as if it con- 
sisted of 2 alone. We therefore assume that the magnetic effect 

of the spin ^ © is twice as great as that of the orbital angular mo - 

mentum 2 ; the perturbation due to an external magnetic field fi 
is therefore to be taken as 


*-£#,8 + «-£( ft * + * S ). 


(4.3) 
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The spin offers an explanation of why the beam in the Stern - 
Gerlach experiment is separated into two parts . The valence 
electron of the univalent silver atom is, in the normal state, 

in an 5 -orbit (l = 0) ; hence j = ^ and m can assume only the 
1 2 

values ± r. Although the component of the mechanical 
Z 

moment of momentum in the direction of the magnetic field 

h 

can have only the values ± the experiment shows that the 

value of the magnetic moment of the atom is a whole Bohr 
magneton, and not the half of one ; but we now see that since 
the mechanical moment of momentum consists only of spin 
it should give rise to twice the expected magnetic moment. 
The connection between magnetic moment and mechanical 
moment of momentum is even more apparent in the magneto- 
mechanical effect : the demagnetization of a vertically suspended 
bar of weak iron must result in giving to it an angular momentum. 
The ratio between the change in the magnetic moment and the 

e 

moment of momentum was expected to be but the experi- 
ment, which was performed only on ferro-magnetic bodies, 
yielded twice this value. The anomalous magnetic behaviour 
of the spin also accounts for this result, if we assume that the 
mechanical moment of momentum in ferro-magnetic substances 
is due entirely to the electron spin. 8 

Does this hypothesis also explain the general Land6 formula 
(4,2) ? This is answered by the formula (3.12) obtained toward 

the end of § 3, in which /, j', J must be taken as Z, j in order 

that it apply to the composition of electron spin and electron 
translation. We find that in the state Uj) the temporal mean 

value of the spin - © is equal to 9Jt multiplied by the factor 
Z 

,_i = l + L=.M±i) 

g 2j(j + 1) 

or 


s - 1 = ± 


Hence by (4.3) 


21+ 1 


eh 


for j — l + 


2 * 


( 4 . 4 ) 
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So long as the magnetic separation is small compared with the 
spin perturbation the Zeeman separation of the term (If) is 
determined primarily by <JF> ; (4.4) then leads, in fact, to 
equation (4.2), in agreement with the empirical data. 

If the atom consists of several, say /, electrons, the situation 
then arising can be understood with the aid of the general rule 
of composition. If the electrons are in quantum states with, 
inner quantum numbers j r and energy levels E(j r ), (r = I, 
2, ••*,/), then on neglecting the interaction between the electrons 
the total system has a (2j\ + 1) • • • (2;*/ + l)-fold energy level 
E(ji) + • • • + E(j f ). If this level coincides with none of the 
other levels it is resolved by a small perturbation into terms 
with total inner quantum numbers 7 in a manner corresponding 
to that in which the product 

®ii X X • • • X (4.5) 

is reduced into its irreducible constituents ®/ ( Clebsch-Gordan 
series). Obviously in order that this (jf) coupling lead to an 
adequate description the mutual interactions between the 
electrons must be small compared with the spin perturbation. 

The situation usually met is, however, the opposite of that 
contemplated above : the normal term order corresponds to 
the Russell-Saunders or (si) coupling. Neglecting for the moment 
the interaction between the electrons as well as the spin per- 
turbation, we are led to a 2/(2 lj + 1) ■ • • (21 f + l)-fold energy 
level (2.4) in whose characteristic space the rotation group in- 
duces the representation 

©f X (® fl X ®* X • • • X % f ). (4.6) 

Due to the interaction between the electron translations the 
second factor is reduced in a manner analogous to (4.5) ; a 
single term with azimuthal quantum number L has now the 
multiplicity 2/(2L + 1). We next reduce 

(4.7) 

and finally, as the last step, we carry out the reduction 

(J = L + s, ■£, + $•— 1, • • •, |Z/ — s\), (4.8) 

associated with the coupling between the spin and the orbital 
moment of momentum. The terms which result from this 
last reduction form together a multiplet. Each multiplet is 
therefore associated with a definite azimuthal quantum number’ 
L and a spin quantum number s ; the individual members of 
the multiplet are distinguished by the inner quantum number J . 
We call 2^+1 the multiplicity , although the number of terms 
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in the multiplet is only actually equal to this when L^s, as 
by (4.8) their number is less if L < s. The 2/-dimensional 
representation is even or odd according as / is even or odd. 
The reduction (4.7) into irreducible constituents accordingly 
yields only integral values for s when / is even and only half- 
integral values when / is odd : The term, multiplicities alternate 
regularly between . even and odd as we run through the atomic table 
in the order of increasing atomic number (H even, He odd, Li 
even, Be odd, etc : “ alternation law "). For f= 2 we have,' for 
example, 

®S = $o + Si- 
lt is empirically found that the bivalent alkaline earth metals 
have in fact a singlet and a triplet system of terms. But in the 
triplet system the 5 terms, for which L = 0, are simple ; only 
the P, D, • * •, terms have the actual multiplicity 3. 

Instead of considering all the electrons at once as in (4.6) 
we can build up the atom by successively adding one electron 
after another. On adding a next electron, say the f th , to an 
atom or an ion A a multiplet of A + characterized by azi- 
muthal quantum number L and spin 5 breaks up into all those 
multiplets contained in the representation (% X X (® £ X ®,) ( 
where l t = l is the azimuthal quantum number of the electron 
added. Since 

< 3) s x 2)* = 2+$ 4- %-i, 

X % = Z^l*, L* = L+.l, L + l-l, ■ ■ ■, \L — Z|, 
this results in multiplets (s*, L*), one for each of the pairs 

s* = s±l, L* — L + l, L + l-l, • • •, \L-l\ (4.9) 

( u branching rule ”). The alternation law is again contained in 
the first of the above equations. It is to be noted, however, 
that the Pauli exclusion principle for equivalent orbits, which 
will be discussed in part C of this chapter, materially restricts 
the array of multiplets allowed by this rule. 9 

Again applying (3.12) to the composition of spin and orbital 
moment of momentum, we find that the 2/ + 1 components 
into which a J term of a multiplet (s, L) is split in a weak magnetic 
field are displaced from the unperturbed positions by the amounts 

hog -m (m = J, J — 1, • • •, — J) (4.10) 

where the separation factor g is given by 

, . ./(./ + 1) — L(L + 1) + s(s + 1) 
g_1+ 27(7 + 1) 


(4.11) 
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This is exactly the formula which was derived empirically by 
Landi ; we here see the importance of the fact that the square 
of the absolute value of the moment of momentum 3J£ (or £ or (§) 
is calculated from the quantum number J (or L or s) by J(J + I), 
etc., instead of / 2 , etc., as in the older quantum mechanics. 

When the magnetic field increases to such an extent that the 
magnetic separation becomes comparable with the separation 
between the terms of the multiplet we must handle both the 
perturbation to which the multiplet separation is due and the 
magnetic perturbation together. In order to express the small; 
ness of the term in the Hamiltonian function to which this 
former perturbation is due, we introduce a factor p which will 
appear in the same way as the factor o in the magnetic term ; 
the case of a weak magnetic field may then be expressed by 
saying that o is small in comparison with p. We can consider 
o and p as variables which increase gradually from 0 to their 
actual values and follow the dependence of the separation on 
their ratio. We therefore write the perturbation term in the 
Hamiltonian function in the form 

W = P W' + oW". 

Since the decomposition (4.8) need not for present purposes 
be expressed in terms of its ultimate constituents, the individual 
electrons, we may here denote the azimuthal and inner quantum 
numbers by l and j. Let the representation spaces of % x 
be t*, 9^ with co-ordinates tj(m 8 ), x(m x ) respectively. Denote 
the moments of momentum 2K*, of these two representations 
by £, £ respectively ; if the magnetic field has as its direction 
the 2 -axis, then 

W" = h(L z + is,). (4.12) 

The co-ordinate system is again to be so chosen that the rotations 
about the 2 -axis appear in reduced form ; to such a rotation 
of angle <f> corresponds the transformation 

g(m 8 ) -* e(— m 8 <f>) • f(m*), x(m l ) e(— m$) • #(m,) ; 

the range of the quantum numbers m s and m x is given by 

m 8 = s, ^ - 1, • • - 5 ; mi = /, l _ 1, . . —i (4.13) 

The variables of r* X % then behave like the (2$ + 1)(2Z + 1) 
products 

£(m 8 ) ‘^(wj) (4.14) 

and are multiplied, under the influence of a rotation <f> about 
the 2 - axis, by where 

m = nt s -f - nt x . 
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We now reduce X into its irreducible constituents 
Let the co-ordinates of the (2j .+ 1) -dimensional irreducible sub- 
space of t s X 5ftj, in which the representation 3), takes place, 
be denoted by 

x(j ; m) (m = j, j- 1, • • •, - ;). 

w is the magnetic quantum number, i.e. under the influence of 
the rotation about the 0 -axis x(j ; m) is multiplied by e ( — m<f>). 
The co-ordinate transformation which leads to the complete 
reduction of X into its constituents %j is obviously of 
such a kind that x(j ; m) is a linear combination of those of the 
variables (4.14) for which m s + m x has the value ra. 

If the unperturbed system possesses no accidental degenera- 
tion the separation is determined by that part of the matrix 
(4.12) in which the sub-space t„ X % of ffi intersects itself. 
We must therefore solve a secular equation G of degree 
(25 + 1)(2 1 + 1) ; but the problem is materially simplified by 
the fact that the perturbation term possesses rotational symmetry 
stbout the 0 -axis, as the only non-vanishing elements of the 
matrix W are those for which w m. The one secular equation 
G is consequently broken up into 2(1 + s) + 1 secular equations 
G m corresponding to the possible values 

m = Z + 5, / + 5 — 1, • * •, — (/ + s) 

of m . The degree of G m is given by the number of possible 
partitions of m into two summands m a + m t which run through 
the ranges (4.13). In the case of a single electron, /= 1, we 
liave only equations of the first and second degrees, and the 
calculation can therefore be carried through completely for this 
case. 10 

The roots of the secular equation G m are the displacements 
of the energy terms due to the perturbation. Since the trace 
of a matrix is an invariant, the sum of the term displacements 
"which are associated with a definite value m of the magnetic 
quantum number (the roots of the secular equation G rn ) is equal 
■to the sum of the terms in the principal diagonal of this portion 
of W y i.e. to 

Z W(m s m h m s mi). 

(m g + mi *» m) 


it is therefore a homogeneous linear function of p and o (“ sum 
rule ”). We obtain the part due to the magnetic field by putting 
p = 0 ; by (4.12) this is 

oW”(m*m h m A m x ) = ho(mi + 2m ,). 


14 
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On the other hand, the formulae (4.10), (4.11) determine the 
term displacements in the case in which o is small in comparison 
with p. In consequence of the sum rule these two results must 
agree. I and s being fixed once and for all, we denote the Land£ 
g-f actor (4.11) by g(j ), and we then have 

2(m x + 2 m s ) = m • 2g(j). 

The sum on the left is extended over all partitions of 

for given m, and that on the right over all values of j which 

are consistent with the conditions 

j = M> M + 1> • • • ; i = 1 + s > 1 + s — 1> • • *, I* — 4 

g(j) can in fact be determined from this equation. For m=l+s 
both sums reduce to a. jingle term ; we then have 

1+ 2s= {1 + s)- g(l + s). 

For m — l + s — 1 there are two possibilities for (m 8) m x ) and 
two for j : mi — l, m s = s — 1 or = l — 1, m s — s ; j — / + s 
or l + s — 1. Consequently we must have 

21 + 4s - 3 = (/ + j - 1 ){g(l + s )+g(l + s - 1)}. 

In this way we obtain recursion formulae for the successive 
calculation of g(l + 5), g(l + s — 1), • • \ The reader can 
readily verify that the result of the first few steps agrees with 
(4.11). 

It is to be noted that in following the terms from a weak 
to a strong magnetic field they cannot cross each other, con- 
sidered as functions of the monotonic increasing parameter 
o:p; the “singular elements ” of a unitary group, i.e. those 
elements for which two or more characteristic values coincide, 
constitute a manifold of three , and not simply one , fewer 
dimensions. 11 


B. The Lorentz Group 

§ 5. Relativistically Invariant Equations of Motion of 

an Electron 

We have as yet obtained no specific expression for the spin 
perturbation ; that for the magnetic effect due to an external 
field was set up with the aid of the experimental facts. It is 
clear that we can arrive at a satisfactory theory of the electron 
only when we are able to express its fundamental laws of motion 
in a form which is invariant under Lorentz transformations, as 
required by the restricted theory of relativity. The solution of 
this problem is due to Dirac. 11 We saw in III, § 8, how the 
2-dimensional representation of the rotation group, which, 
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following Pauli , characterizes the covariant quantity ip = (ip X) t p 2 ) 
describing the wave field, can be extended to the group of 
positive Lorentz transformations, ip lt ip 2 play the same role as 
the variables f, rj introduced in connection with 

Following de Broglie we took as the wave equation of a 
particle of mass m in field-free space 



1V_ 
c 2 IP 


> 



But this equation is not in agreement with the general scheme 
of quantum mechanics, which requires that only first order 
derivatives with respect to the time appear. The formulation 
of a relativistically invariant differential equation satisfying 
this requirement is, as Dirac discovered, made possible by the 
transition from the scalar wave function ip to one with two 
components. We seek to derive these dynamical equations 
from a Hamiltonian principle. 

Let 

x 0 = ct, x x = x, x 2 = y, x 3 = z 


constitute a normal co-ordinate system in our 4-dimensional 
space-time. If the quantity co is of the same kind as ip, the 

quantities ipS^co behave, in accordance with III, (8.16), like the 
four components of a 4-vector ; the S a are the matrices defined 
in III, (8.15). Hence in particular 


3 


its* 

p = o 



are the components ds * of an infinitesimal vector ; we are here 
dealing with a linear correspondence which is independent of 
the co-ordinate system employed and which sends the vector 
dx over into ds . Its trace 



(5.2) 


is consequently a scalar and its integral (multiplied by 1 ji) 

M = -H £x(i S x ~- ■ dx ( dx — dx 0 dx t dx t dx 3 ) , (5.3) 

^ J ot vX# 

extended over any finite portion of the world, is a quantity which 
is independent of the co-ordinate system.* 


* The letter M used for the material part of the action is not to be confused 
with the moment of momentum. 
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Although M may not be real, it is practically real in the sense 
that M — M is the integral of a complete divergence. For 
since the S a are Hermitian matrices, 



and M — M is in fact the integral of 

1 

ta ' 

In using M as an action we are not interested in M itself, but 
only in its variations 8 M caused by arbitrary infinitesimal 
variations Sip of tft = *p 2 ) which vanish outside of a given 

finite portion of the world (the integral is then extended over 
the entire world or, what amounts to the same, over this finite 
portion). The circumstances mentioned above guarantee that 
8 M is real ; on writing it in the form 

8 M = | (8 ijt • a) + • 8iff)dx 

we find on comparison with (5.3) that 



We thus arrive at the first order differential operator 

V - ZS.±. (5.4) 

From the invariance of (5.2) it follows that this operator trans- 
forms ^ >(i t ) into a quantity iji' = (</^, </4) which trans- 

forms contragrediently to $= (fa, fa) under the influence of 
an arbitrary positive Lorentz transformation. If we wish to 
guarantee that M is real, we may replace the original definition 
by 



In III, § 8, we found it necessary to introduce quantities 
fa, fa z which transform contragrediently to fa, fa in order to 
be able to extend the restricted Lorentz group to the complete 
group. And just as V applied to ifi generates a quantity of the 
kind fa, in the same way the “ conjugate ” operator 

a 
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transforms f into a quantity of the kind f V'V is, as is readily 
verified, the operator 


<>*o 



Consequently equation (5.1) for can be written in the form 


= 0, 

iv V' + iff = 0 

* J 


(5.6) 


on introducing an auxiliary pair of components From now 
on we denote the column of the four components if, X) ^ ; if/ v ft 
by ip and employ S a as the symbol for the transformations of 
these four components as in the latter part of Chapter III; 
with this understanding the differential equations (5.6) arise 
from an action integral which is composed additively of the 
quantity M, (5.3), and the invariant [cf. Ill, (8.19)] 


M f = m 0 j*^7Y « dx. 


M and M' are also invariant with respect to interchange of 
right and left, and under the spatial reflection i in the origin. 

In accordance with the general scheme of quantum mechanics 
the differential equations for ip should, as already remarked, 
contain only the first derivative of ijj with respect to time ; the 
additional requirement that it be relativistically invariant then 
leads to the conclusion that it can also contain only first de- 
rivatives with respect to the spatial co-ordinates. We have 
here been able to satisfy these requirements without altering 
the actual content of de Broglie’s equation (for the components 
^ i > * 1 * 2 ) J the equations thus obtained are to be taken as the 
equations for a free particle. This formal transition to first 
order equations will become physically significant only when 
we pass to the derivation of the equations of motion in an electro- 
magnetic field with the aid of the principle of gauge invariance 
developed in II, § 12. According to it, if — <f> 0 is the scalar and 
$2) ^3 the vector potential, we must replace 
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It will be found convenient in the following to introduce the 
quantities f„ obtained by multiplying the potentials <f> x by the 

factor 7 -. Then in 
he 

M = • dx (5.8) 


the operator V is defined by 




(5.9) 


Because of this gauge invariance the quantities M, M' are 
unchanged on replacing simultaneously 

if> by e iy ifj and f„ by f x — (5.10) 


where A is an arbitrary function of position in space-time. Now 
take A to be an infinitesimal function which vanishes outside 
a certain finite portion of the world ; then 8M and 8M r must 
automatically vanish for the variations 

8 if) = iX ■ if), 8 /. = - (5.11) 

The complete expression 

8{M + M') = J [(8 £ • co + w ■ Bifr) + 2fs° 8 f„)dx 

for the variation automatically tells us that under the assumption 
that the laws of matter ( 5 . 6 ) are satisfied, i.e. that o> = 0 , 


8{M + AT) = f Zs«8f tt • dx. 

J <% 

Hence we have as a consequence of the laws of matter 



i.e. the continuity equation 



A glance at the explicit expression for M shows that 

s* = if ) ; 


(5.12) 

(5.13) 


these are the quantities which formed the starting-point for 
the theory of the transformations of if) as developed in III, § 8 , 
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and we already know that they form the components of a 
4-vector which is independent of the particular space-time 
co-ordinates employed. The time component 

s° = ifsifj = ( + ^2^2) + + ^2^2) (5.14) 

is the probability density and hence cg — cis 1 , s 2 , s z ) is what 
may be called the probability current : in order to obtain the 
number of particles which will on the average pass through 
a surface element do in time unit, multiply the total number of 
particles present into the* product of the area do and the normal 
component of the vector eg. On integrating the equation (5.12) 
over a volume V we find that the increase in the mean number 
of particles in V per unit time is equal to the mean number of 
particles entering V through the surface in unit time. In 
contrast to the provisional scalar theory , the Dirac theory leads in 
a most natural way to expressions for the probability density , as 
well as the probability current , which depend on ifj alone . 

On integrating 

j s°dx t dx 2 dx 3 

over the whole of space we find that the integral is independent 
of time — and, in accordance with the statistical interpretation 
of iff, is to be so normalized that its value is 1. Consequently, 
in the dynamical law 

the energy Hjh is a Hermitian operator, as should be. We 
shall from now on take h as the unit of action , with corresponding 
units for linear and angular momentum . The result of this is 
that the quantity h disappears completely from the laws of 

quantum mechanics. With the usual abbreviation, p M = 

\h = /, + i s r (p r + fr) + m, T. (5.1 5) 

c rsa ] 

The influence of the electro-magnetic field on the matter is 
taken care of by (5.9), but, on the other hand, the matter gener- 
ates the electro-magnetic field in accordance with Maxwell’s 
equations. In order to express this explicitly we must add to 
M + M' the Maxwellian action 

F = 2 J {(/!, + fl + fu) - (/,*< 0 + fl 0 + /&)}<** (5-16) 



216 APPLICATIONS OF GROUP THEORY 
of the electro-magnetic field, where the 

f _ Ve 
Jmf *x a ~ 7>x e 

are the field strengths — which are unaffected by the change of 
gauge (5.10). F is obtained from 

yj(§ 2 - &)dVdt (5.17) 

( £ \ 2 g2 

he) ” ch? * s t ^ le act ion in 

Heaviside units, which are best adapted to the electro-magnetic 
field theory. Since we have taken h as the unit of action, the 
total action of our system, consisting of matter plus field, is 

W = M + M' + -F (« = £). (5.18) 

For reasons which will be apparent later the real number a/4w 
is called the fine structure constant. Whereas the variation 
of the in the Hamiltonian integral JJ V • dx yields the equations 
of matter , variation of /„ leads to the equations of the electro- 
magnetic field with 

- e- 5* = - e^S a tf, (5.19) 

appearing as the 4-vector of charge and current density. The 
only constants occurring in the field equations are the two 
combinations 


cm e * 

m, = T' a = 5 t'- 20 ) 

of fundamental atomic constants ; the first is a reciprocal 
length and the second a pure number. 

Schrodinger. in his fundamental papers on wave mechanics, 
thought he could explain the quantum behaviour of matter 
and radiation classically ” by setting up a closed system of 
field equations such as we have obtained above. In particular, 
he held that the charge of the electron was actually 44 smeared ” 
over the whole of space with the density - e-s*. But there can 
be no doubt at the present time that the field equations are not 
to be interpreted m this classical manner ; they must rather 
be interpreted in accordance with the statistical view-point 
developed m Chapter II. The expression (5.14) for the density 
tnen guarantees the atomistic structure of electricity . To show 
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this we first remark that the charge in a volume V is represented 
by — e times the Hermitian form 

j \ \^dx x dx z dx z . 

( v ) 

But this is an kt idempotent ” form with respect to the “ vector ” 
; its characteristic values are 1, 0 and the corresponding 
characteristic functions are those quantities i ]j which vanish 
outside or inside V , respectively. The charge contained in V 
is accordingly capable of assuming only the values — e and 0, 
i.e. according to whether the electron is found in V or not. In 
order to guarantee the atomicity of electricity the electric 
charge density must equal — e times the probability density. 
But if we base our theory on the de Broglie wave equation, 
modified by introducing the electro-magnetic potentials in 
accordance with the rule (5.7), we find as the expression for the 

charge density one involving the temporal derivative — in 

addition to ^ ; this expression has nothing to do with the prob- 
ability density and is not even an idempotent form. According 
to Dirac this is the most conclusive argument for the stand 
that the differential equations for the motion of an electron in 
an electro-magnetic field must contain only first order derivatives 
with respect to the time. 13 Since it is not possible to obtain 
such an equation with a scalar wave function wdiich satisfies at 
the same time the requirement of relativistic invariance, the 
spin appears as a phenomenon necessitated by the theory of 
relativity. 

The theorem of the conservation of electricity (5.12) follows, 
as we have seen, from the equations of matter, but it is at the 
same time a consequence of the electro-magnetic equations. 
The fact that (5.12) is a consequence of both sets of field laws 
means that these sets are not independent, i.e. that there exists 
an identity between them. The true ground for this identity 
is to be found in the gauge invariance, for it is equivalent to 
the assertion that 8W vanishes identically when $ and /* are 
subjected to variations of the form (5.11). We have 

SW = J {(8 • w + S • ty) + £L« 8f x }dx, 

where to = 0 are the equations of matter and L* = 0 the 
Maxwellian equations. On substituting the variations from 
(5.11) and integrating the last term in the integral by parts, 

* a 
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Because of the arbitrariness of the gauge the number of inde- 
pendent equations must be one less than the number of unknown 
functions <}> and f„. 


§ 6. Energy and Momentum. Remarks on the 
Interchange of Past and Future 


I. Energy and Momentum. 


The complete field equations are explicitly 

f‘ Sa (ri +/ “V + ,Mo ' :r,/,=0: 

div @ + p = 0, — curl § = 3. 

’ Zx 0 J 


( 6 . 1 ) 


Where 6 and § are the electric and magnetic field strengths : 

E — .... H = .... /c g\ 

p is the charge density ijnp, and the components s lt • • • of the 
current 6 are given by 

Si = $ **\ (6.3) 

In addition to the differential law 


div 3 = 0, (6.4) 

expressing the conservation of electricity , we have a vector con- 
servation law governing energy and momentum. A completely 
satisfactory expression for the tensor representing density and 
flux of energy and momentum is only to be obtained along the 
lmes employed in the general theory of relativity. Here we 
give only the result for the density of energy — c * t° 0 and mo- 
mentum (£j, £j}), and in doing so we separate the material 

from the electro-magnetic part. We have for the part referring 
to matter 


- < - s£{^(s5 + - (£ - *)? • V} 

+ « 0 ?7’*;f (6.5) 

*»= ... 

2i\ Zx 1 Zx?) ^ 4 Vax 2 ZxJ’ 

U e _ h i aV o h <T “^ oduced > in Edition to S» the operator SI 
W — LA a; which acts on all four components of ifi ; whereas 
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the former subjects ip 1} to the 2-dimensional transformation 
S p [III, (8.15)] and ip' l} ip' 2 to — S p , the latter exercises the 
same 2-dimensional transformation S p on both pairs of com- 
ponents. Correspondingly 

s' p — $Spift. 


The density of energy and momentum due to the electro-magnetic 
field is given by the familiar Maxwellian expressions 

- 4 + +) + (»! ++»;1 (65) 

4 = «(£A - e,h„, • ■ 

We find the conservation laws 


f 25. 

«-o 


0 ; 


jrK 

a«0 


= o, 


( 6 . 7 ) 


as consequences of the field equations. Furthermore, the tensor 
t is symmetric — not identically, but in consequence of the field 
equations ; in this sense we have 

t$ + t° P = 0(p = 1,2,3); tl = t*(p,q= 1,2,3). (6.8) 


On combining these with (6.7) we obtain the divergence con- 
ditions 


t <K*2 *3 — *3 j g) 
<*■“0 


= 0 , 


( 6 . 9 ) 


y Mg* *1 + *1 to) = Q 

OfassO 


( 6 . 10 ) 


These results can all be verified directly, but their deeper 
significance can be understood only by going over to the general 
theory of relativity as mentioned above. Just as the theorem 
of the conservation of electricity follows from the gauge in- 
variance of the equations, the theorems for the conservation 
of energy and momentum follow from the circumstance that 
the action integral, formulated as in the general theory of 
relativity, is invariant under arbitrary (infinitesimal) transforma- 
tions of co-ordinates. In this general relativistic formulation we 
need further to erect a normal set of co-ordinate axes at each 
point P of space-time, consisting of four mutually perpendicular 
directions at P (“ orthogonal ennuple ”), in order to fix the 
metric at P and to be able to describe the wave quantity ip in 
terms of its components ; all permissible orthogonal ennuples 
at P are obtainable from each other by local Lorentz transforma- 
tions which leave P invariant. But the rotations of these local 
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ennuples can be performed in the various points P quite inde- 
pendently — the quantities at various points are not bound to 
each other as in the special theory of relativity. The symmetry 
of the energy-momentum tensor can be traced back to the 
invariance with respect to such rotations. One can in fact 
take it as a general rule that every invariance property of the 
kind met in general relativity, involving an arbitrary function, 
gives rise to a differential conservation theorem. In particular, 
gauge invariance is only to be understood from this standpoint. 
It follows from the transformation laws for ip that its four com- 
ponents fa relative to the local ennuple are determined only to 
within a common factor e ix of proportionality, the exponent A 
of which depends arbitrarily on position in , space-time ; in 
consequence of this it is necessary, in order to obtain a unique 
covariant differential for ip, to set up a linear form EfadXc, which 

at 

is coupled with the gauge factor contained in ip in the manner 
required by the principle of gauge invariance . 14 

We obtain the integral conservation laws from the differential 
ones by integration. We set up the integral 

$iyV = J„ (dV = dx 1 dx t dx 3 ) 


over a section x 0 = const, of space-time and find that it is 
independent of x 0 . — C J 0 = H is the energy and (J u J 3 ) 

the linear momentum. The material part is, on a simple in- 
tegration by parts, ? 


70 = 

71 = 


-r * 


?J’ ( Cik + , ’) + m ° T V iV ’ 


i 


dV, 


These are Hermitian forms in the “ vector ” f They again lead 
us to associate the operators A) with the components 

linear momentum, i.e. to the assumptions with 
which we, following de Broglie and Schrddinger ) began. For the 
energy we obtain (on dividing by c) the operator 

; H ir r +/ 0 + m ° T ’ 

" (6 - 16) •' the diff ““ tial 


<1 ~b 


, A, . 


1 
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Moreover, we must not forget that to the part due to matter 
we must yet add that due to the electro-magnetic field. 

The quantities 

M i = \M- x 3 it)dV, • • (6.11) 

which are by (6.9) also constant, are the components of the 
moment of momentum. We find from (6.5) that the part due 
to matter is 



In agreement with our earlier assumptions we here obtain the 
operator which is composed of the sum of the ^-component 

~\ x i~^Z x 3 err ) °f orbital moment of momentum and the 

t\ 3 oX 2 / 

spin moment of momentum i S[. The vector 

i< S’ = 5 (Si, Si, Si) 

is actually the spin, for in accordance with the law of trans- 
formation of both iff pairs ^ 2 ), (0' f of components suffer 
the same transformation cr as in the Pauli theory of the spin 
under the influence of the transformation a (spatial rotation) 
of u 2 . 

On integrating equations (6.10) over the spatial section 
x 0 = const, we obtain 

which we may consider as the law of inertia of energy. The 

H 

integral may be written J 0 • f , = — — • g l9 where f 2 , £ 3 are 

the co-ordinates of the “ centre of energy ; the equations are 
then 

r -!L 

Jl ~~ r 2 * dV 

We thus obtain the familiar mechanical law : Momentum is 
equal to mass times velocity , where the velocity is to be taken as 
that of the centre of energy and the mass as 1/r 2 times the energy 
content of the field. Nevertheless it is advisable not to divide 
by H in defining the centre of energy, as the energy density 
— is here no longer positive-definite, and we cannot be certain 
that the energy content H will turn out to be positive. 
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Our theory is a classical field theory, the quantum features 
entering only in the statistical interpretation. With this 
interpretation the field laws are concerned with a single electron . 
At the present stage of our development we can deal only with 
the additional quantities due to the electro-magnetic field by 
assuming a given external field affecting the motion of the 
particle, without the particle reacting on the field ; we must 
then surrender our Maxwellian equations. The true laws 
governing the interaction between electrons and quanta will 
only be obtained, in analogy with II, § 13, on subjecting the 
system of field equations to the process of quantization, just 
as was done by Heisenberg for any system of classical mechanical 
differential equations. 

The fact that we are led back to our original assumptions 
concerning the operators representing position and momentum 
is due to the particular expressions we have chosen for the 
action, from which the field equations were obtained ; indeed, 
it depends entirely on the part M. These original postulates 
of quantum theory are accordingly of less interest from the 
standpoint of general principles than we at first believed. But, 
on the other hand, this connection seems to indicate that M 
cannot be replaced in its role as representing the action due 
to matter. M is also responsible for the fact that the charge 
and probability densities agree, which is unconditionally re- 
quired as a guarantee of the atomistic structure of electric 
charge. These connections with the most fundamental physical 
observations thus require that the action be composed additively 
of M and further terms which are invariant not only under 
change of gauge (5. 10) as is M, but also on replacing if/ by e ix • if/ and 

f« by f» — where A and fi are two independent arbitrary 

functions in space-time. M' and the Maxwellian action F are 
in fact of this kind. Further relativistic invariant scalars 
satisfying these conditions are readily found — indeed it is not 
difficult to set up the most, general action possible with the 
quantities at our disposal. But we have yet to be convinced 
by physical observation that the three quantities Af, M\ F 
here employed do not suffice. 

II. Electric and Magnetic Spin Perturbations . 

In order to be able to compare Dirac’s theory with the facts, 
we eliminate if/[ } ifr' 2 in the same way as we did in the absence 
of the electro-magnetic field. We obtain the equation 

— VVA = 
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with the new definition (5.9) of V and V'. The substitutions 
S a in two variables satisfied the equations 

SqSi = SiSq = ; ^ 2^3 = — S^2 = iS i ] 

and consequently those denoted by the same letters but operating 
on all four variables obey 


5 0 S 1 = S X S 0 = S x ; 5A - - 5 3 S 2 - iS[. 


V'V contains terms of the following four types : 

(1) ( w 0 + ifo )( it 0 +ifo )’ 

(2) ~(^i +ifi )(£i +if 'Y 

(3) 5l {(^ 0 + ifo ) (w t + ifl ) ~ (w, + ik ) (w c + z/o )}’ 

(4) - is , 1 {{ w t + ' ik ){^ 3 + ifa )- (4 + i/s )(i + t/4 )}‘ 

We collect together terms of types (1) and (2) to form the 
“ regular term ” in which the components of ift are not coupled 
with each other : 






[The transition from lower to upper indices, i.e. from “ co- 
variant ” to “ contravariant ” components, is performed in 
accordance with the equations ft == — / 0 , ft = f p (p = 1, 2, 3).] 
The irregular term consists of the electric part 



^) + + = + +) 


and the magnetic part 

These become, on multiplying by the factor h and expressing 
the electric and magnetic field strengths 6 and § in the usual 
units, 

i c m , ;(©•©. 

We have already (II, § 12) calculated the regular term for a 
homogeneous magnetic field and found it to be - (§£). On adding 
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the regular and irregular terms we obtain, on neglecting the 
squares j\ of the potentials, 

This contains the fact, which was already derived in § 4 from 

. 1 

spectroscopic data, that to the spin ^ , twice as great a magnetic 

moment is to be ascribed as to the same amount of orbital 
moment of momentum ; we have now obtained a convincing 
theoretical foundation for this procedure. The laws governing 
the interaction of a general inhomogeneous magnetic field with 
orbital and spin momenta emphasize still more emphatically 
the essential difference between 8 and ©*. The irregular electric 
term, calculated for the central-symmetric field originating in 
the nucleus, is the spin perturbation. 

The description of the electron given earlier, according to 
which it was a composite structure composed of two kine- 
matically independent parts — the electron translation, with an 
oo -dimensional system-space, and the electron spin, with a 
2-dimensional system space — is, in view of the Dirac theory, 
no longer quite appropriate. But the classification of spectra 
given there is none the less valid here, for it depends only on 
the fact that to the group of rotations of physical space corre- 
sponds the representation 3)* X © in the total system-space. 

From the field equations (6.1) as they are to be understood 
for the present, i.e. as the laws of motion of an electron in an 
external electro-magnetic field, dispersion pheyiomena can be 
(approximately) calculated ; they tell us how the motion of the 
electron in the normal or other quantum states is affected by 
the incident light wave. From the perturbed *ft we then deter- 
mine the scattered light with the aid of Maxwell’s equations ; 
to this class of phenomena belong in particular the Compton 
and Smekal-Raman effects , 15 Spontaneous emission can be 
handled similarly if we take the considerations of II, § 13 , as 
justifying the following procedure: The polarization and 
intensity of light emitted by the quantum jump n n' of the 
atom is to be calculated by integrating Maxwell’s equations, 
where the expressions i/jiff, for charge and current density 
are to be understood as ^(n)@^(n o 0(*> being the 

characteristic function of the atom in the n ^ quantum state. 
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III. Interchange of Past and Future . 

The action is so constructed, that it is invariant under inter- 
change of right and left ; the corresponding substitution is 

*; 2 ? 72 - 1 ' 2 . 3 > 

Jo Jo, Jp Jp j ' 

’A ^2 -> </4 ; 'P'i -> <Aj, <A 

Does a corresponding result hold for the interchange of past 
and future? The foundations of the theory lead to the hope 
that it will be able to take account of the essential difference 
between the two time directions, so obvious in Nature. But 
Dirac has remarked that M, M' go over into — M, — M f under 
the influence of the substitution 

—*■ foe -> foe ( a == I jgj 

01 01» 02 7* 02 J 01 — 01, 02 — 02-/ 



Hence when, in dealing with the motion of an electron in an 
external electro-magnetic field, we obtain a solution if/ which 
contains the time in the factor e~ ivt } this substitution will lead 
us to a new solution which contains the time in the factor e ivt ; or, 
more precisely, a solution of the problem obtained by changing 
/ into — /. But this can be done by retaining the same external 
field with potentials <f> and replacing e by — e. We denote such 
a particle, whose mass is the same as that of the electron but 
whose charge is e instead of - as a “ positive electron ” ; it 
is not observed in Nature ! It follows from what has been said 
above that the energy levels of such a particle are — hv, where 
hv are those of the negative electron. Disregarding this differ- 
ence in sign, the two particles behave the same. The electron 
will possess , in addition to its positive energy levels, negative ones 
as well , the latter arising from the positive energy levels of the 
positive electron on changing signs* as above. Obviously some- 
thing is wrong here ; we should be able to get rid of these negative 
energy levels of the electron. But that seems impossible, for 
under the influence of the radiation field transitions should occur 
between the positive and negative terms. That we have twice 
as many terms as we should is obviously related to the fact 
that our quantity if/ has four instead of two components (satisfying 
first order differential equations). The solution of this dif- 
ficulty would seem to lie in the direction of interpreting our 
four differential equations as including the proton in addition 
to the electron, 
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The substitution (6.13) transforms the terms Af, AT of the 
action into — M , — AT, but leaves the Maxwellian term F 
unaltered. Our field equations as a whole, i.e. when we also 
take into account the reaction of the particle on the radiation 
field, are consequently not invariant under this substitution* 
However, there does exist a substitution which reverses the 
direction of time and which at the same time leaves all terms in 
the action invariant. We mentioned in III, § 8 that the ex- 
pression (5.13) formed from a 0 with two components takes on 
the sign 8 a : 8 0 = 1, 8* = — l(p = 1, 2, 3) on going over from 
0i, 0 2 to 0 2 , — Hence if cu is a quantity which transforms in 
the same way as 0 then 

0 S* to -> 8* • a> S* 0 ; 


on applying this to to = 


ZV 1 we find ^at 

i'dxp p 




Hence if we make in addition the substitution 


then 


*o -* — *(>, (#>=1,2,3) 




0 


and consequently M, formula (5.5), remains invariant. In the 
presence of an electro-magnetic field its components must 
change signs in accordance with 

/o “*■/<>» fp -*■ — ft iP — 1, 2, 3). 

We have thus found that M, M’ and F all remain invariant 
under the substitution 


*o 

/»■ 


:n^>=i, 2 , 3, I 
* ™ i ; 0i -*■ Pi, -* - 0 J 


(6.14) 


This shows that the past and the future enter into our field 
theory in precisely the same manner— in spite of the fact that 
the sign in the exponent of the time factor e~"‘ of a solution of 
the quantum problem is unchanged by the substitution (6.14). 

of , cour ^ su spend judgment as to whether the laws 
governing interaction between photons and electrons allow us 
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to distinguish between these two directions in time until we 
have carried through the quantization (§ 12 ). 

§ 7. Electron in Spherically Symmetric Field 

We now proceed to the discussion of the behaviour of an 
electron in a spherically symmetric electrostatic field in Dirac's 
theory. 

/. Dirac's Conservation Theorem . 

From the definitions follow immediately the commutation 
rules : 

S P T = - TS» S V T - TS ; (p = 1, 2, 3). 

We need further the results 

SA = 1 } ^2^3 5=1 — S2S3 = iSi 
and the commutation rules 

L\ pi — Pi ^ 0* (^S) ^ Pi L'l T" P 2 -^2 Pz J-'z ~ 0* 

T\p2 ~~ P 2 Al = ipz , pi pi L 2 = ip& 

for the components of linear and angular momenta p = (p l5 j> 2} £ 3 ) 
and fi == (A, L 2 , L 3 ). 

In a spherically symmetric electrostatic field A = A = A = 0 
an( j[ / 0 = # is a function only of the distance r from the centre. 
With the aid of the formulae given above it is easily shown that 

M l = L 1 +^S’ 1 

commutes with <t>, T, (@’|>) and consequently wfth each term in 
the expression 

lH = <P + (®p) + m 0 T (7.1) 

for the energy H, Indeed, this conservation law for the total 

moment of momentum 8 JI = S + ^ @* was already known to 

us from general considerations. We further find that (@ S) 
commutes with 0 and T, but that 

or 

(@’p){(@’S) + 1 } + {(S’S) + !}(©’*») = °- 

Hence (©’£) + 1 anti-commutes with (<B'p) and therefore also 
with (©p) ; its commutation properties with respect to the three 
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terms of (7.1) are therefore the same as those of T. 
setting 


(©’£) + 1 = kT, 


Hence on 
(7.2) 


k is a scalar which commutes with the energy H (where by scalar 
we mean invariant under the group of rotations of space). 
Consequently we can decompose the system-space of the electron 
into irreducible sub-spaces associated with the rotation 
group, in such a way that the quantity k, which we call the 
auxiliary quantum number , as well as the energy H, 
possesses a definite value in each of the sub-spaces. Now 


(< 5 ’ 2) 2 = {Lt++} + {S’ 2 S;(L 2 L 3 - L 3 L t ) + + } 

= 2 2 - (S\L 1 + +) = £*_ (©’£) 

and consequently 

{(©’2) + l } 2 = 2 2 + (©’2) + 1 = (2 + |©’) 2 + j = W + 1 

W = k*- i 

4 

This agrees with 

w=j(j+i)=(j + \y ( 7 . 3 ) 

when we put 

; = 1*1-5, 1*1 -y+ 5. (7.4) 

Accordingly , the auxiliary quantum number k is a non-vanishing 
integer. The conservation theorem (7.2) goes beyond (7.3) in 
giving us in addition the sign of k. For a given half-integral j 

the two values k = ± (^j + are both possible ; they must 

correspond to the two possibilities l = j ± \ of our previous 
notation. The single quantum number k replaces the two l, j. 


II. The Differential Equation for the Determination of the 
Characteristic Values. 

Since the field is spherically symmetric, it suffices to carry 
through the calculation for the point x = 0, y = 0, z = r. At 
this point 
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and the Dirac conservation law (7.2) becomes 



together with the equations obtained from these by interchanging 
the two pairs *ft 1} \fj 2 and if/[, of components. The differential 
equation (6.1) for the characteristic vector */r, which contains 
the time only in the factor e~ ivt , has as its four components the 
two 



and two others of analogous structure ; we have here written 

E = E — 0 — U. 

c’ 

The derivatives with respect to # and y which appear in (7.6) 
can be eliminated with the aid of (7.5) ; the resulting equations 
are 

[ D -'t? + 3?) ]* + (-“• + tV- 0 

where 

7=^(0, 0, r), g = 0, r). 

The remaining two equations are obtained by writing (i/4, 0 2 ) 
in place of (j/q, i/q). At an arbitrary point P == P(x , z) the 
first and third components of i/j satisfy the equations (7.7) in 
a rotated co-ordinate system whose positive 2 -axis passes through 
P. We shall find it convenient to introduce rf and rg as variables 
in place of / and g , as 

I® _ f l i ±\ f 
r dr ~V + dr) J ' 

If we wish to avoid the explicit appearance of i in the equations, 
we may write 



rf == v -p iw, rg = v — iw 
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and obtain, finally, the fundamental equations 


rr dw k A 

Uv — -^ — m o v — -w = 0, 
Uw + ^ m 0 w — = 0. 


III. Spherical Harmonics with Spin. 


(7.8) 


Let f(r), g(r) be a solution of equations (7.7) ; then in the 
rotated co-ordinate system 

0i = / • P> 0i = g • p; 02 — / ■ * , 02 = g- r 
where the factors p, r are constants independent of r. On 
returning to the original co-ordinate system each of the pairs 
0ii 0s > 0n 02 undergoes the transformation o- associated with 
the rotation 5. Consequently 

01 = fPi + gr 1 0i' = gPi + /*1 / ? 

02 = fPt + gr* 02 = g/>2 4- /T a V ' ' 

in which / and g depend only on r, and the factors />, r only on 
direction, i.e. on the spherical co-ordinates 6 , <f> introduced by 
setting 

# -4- iy = r sin 0 s = r cos 6 ; 


the coefficients in (7.9) must further satisfy the conditions 

Pi(l — cos 0) — p 2 sin 0^“^ as 0, (7.10) 

Tj^l + cos 0) + r 2 sin 0 £" ** = 0. 

On substituting the expression for £ in polar co-ordinates 
[II, (4.10)] into the Dirac conservation law, we are led, with 
the aid of (7.9) and (7.10), to the differential equations 

sin + i~f + k(l + cos 0 )r x = 0, 

7 7 (7.11) 

sin 0 — i-~r — fe(l — cos 0)p L = 0. 

We have thereby accomplished the transformation of the Dirac 
wave equation into polar co-ordinates. (7.9) corresponds to the 
substitution ift = f(r) Y x of the scalar theory ; in place of the 
single factor f depending only on the distance r we have here the 
pair f g and in place of the surface harmonic Y x depending 
only on the direction we h^ve the matrix 


Pl 

Tl 

Pi 

Tl 
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The equations (7.11), together with the conditions (7.10), define 
the “ surface harmonics with spin of order k ** ; they are quite 
independent of the potential 0, The characteristic values E of 
the equations (7.7) or (7.8) are the energy levels associated with 
quantum number k . 

As in the theory of the ordinary spherical harmonics, we 
here again seek out those spherical harmonics with spin which 
contain the meridian angle only in the multiplicative factor e im ^ : 

Pi = e im* ( s in d)~ m • P, Tj == e iw + (sin 6)~ m • Q. (7.12) 

Substituting these expressions in (7.11) and taking z = cos 6 as 
the independent variable, we find 


(1 ~ z) Tz = “ mP + kQ > 
(1 + z) d £ = mQ — kP. 


(7.13) 


We denote the solutions P, Q of these equations which lead to 
non-singular functions p, r on the sphere more precisely by 
PjT\ ^ suffices to consider the case k > 0, for (— P, Q) 

is a solution of the equations obtained by changing k into — k : 

mw = - m*), = am au) 


Furthermore, 


dPt») 

dz 




dQW 

dz 


QS m ~ 1 >, 


for the derivatives of P< m ), <2< m ) satisfy the differential equations 
(7.13) with m — 1 in place of m. For m — — k, P ~ 1, Q, = — l 
is a solution which satisfies all continuity requirements on the 
sphere, since the multiplicative factor 

(sin 6)~ m e im<t> = (# — iy)~ m 

is finite for negative m. Consequently we find polynomial 
solutions of (7.13), the degrees of which are 0, 1, • • •, 2k — 1 
corresponding to the values ra = — k ) — k + 1, • * •, & — 1. 
The solution for m = k — 1 is 


P(z) = (1 - »)*-*(! + z)\ Q(z) - (1 + z) k ~ l { 1 - *)*. 

We thus finally obtain the following explicit expressions for the 
spherical harmonics with spin : 

df d'P 

P[ m) (z) = ^{(1 - + *)*}, QTKz) = + ^-’(1 -*)*} 

(7.15) 
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where p — k — 1 — m. They behave very much like the 
ordinary spherical harmonics. The following equations are 
also of importance : 

m- *) = (- 1 r . on*), m- *) « (- ip . (7.ie> 


§ 8. Selection Rules. Fine Structure 

I. Selection Rules. 


In a solution ip defined by (7.9), (7.12) <js lt like p x and Tl> 
contains <f> only in the factor e im * and tp t , like p t and r % only in 
the factor e*( m+ l >* ; correspondingly for ifi{, i/r.). Hence ' 


M, 


M, 


*■ " 7 (” + *)*>• 

*•“7 + 


The s-component of the moment of momentum in the state 
(k, m) is accordingly m + This change in the meaning of 

the quantum number m is to be carefully noted : tn + runs 

*d 

through the values 

^ 2’ ^ 2’ ' ’ ’’ — As + g saa j t j — 1 , • • * ( — y. 


as it should. 

In order to obtain the selection rules for the possible transi- 
tions ( k , m) -*■ (k’, m’) and to obtain the corresponding intensities 
we must calculate the matrix which represents the energy of 
interaction between the atom and radiation in terms of the 
co-ordinate system determined by the characteristic functions 
r defining the quantum states n of the atom. Proceeding 
as in II, § 13, we see from (5.15) that this matrix is 


e£Sf,j>p . 
?-i 


The vector ec<Q here plays the same r61c as 4 there. The in- 
tensities are essentially determined by the dements ©(««'). 
the three components of which are 


Spitin') — 

The selection rules are merely consequences of the fact that 
© is a vector. We first obtain the old result for m and j from 
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considerations involving the proper rotations of space. The 
rule for j asserts that the auxiliary quantum number k may go 
over into 

± (* - l). ±(k + 1). (8.1) 

To the reflection i corresponds the interchange T of the two 
pairs (»/q, i/f 2 ), («Ai, ^ 2 )* In polar co-ordinates this reflection 
consists in the transition from (0, <f>) to ( 7 r ~ 6, tt + </>) ; 0 = cos 0 
is thereby transformed into — z and the factor e im * takes on the 
sign (— l) w . In accordance with (7.15) and the expressions 
for p l} r x ; p 2 , r 2 this results in an interchange of p ly r x with 
possible change of sign, as represented by the substitution 


0 1 


0 1 

1 0 

= (- 1)* -1 

1 0 


and the same for p 2y r 2 . By (7.9) we therefore have for ifj with 
auxiliary quantum number k : 

x, - y, — z) = (— 1)*"V(*. y , z). 

The sub-space 91*. thus has the signature 8 = (— l)* 5 " 1 ; this 
result was derived under the assumption k > 0. On replacing 
k by — k and applying (7.14) we find in place of (7.16) : 

mi-*) = (- i) p+ 1 q ( -M «- *) = (- 1 ) p+i p ( _m 


The signature corresponding to auxiliary quantum number 
— k (k > 0) is accordingly (— 1)* . On setting 


l = — k when k is negative (^j = 


fe — i — Z — - 
^22 


k — 1 when k is positive = fe — - = / + 


( 8 . 2 ) 


both possibilities are included under 8 = (— l) 1 , or we could 
also write 8 = sgn k * (— l) fc ~ l . The only coefficients occurring 
in a proper vector are those corresponding to transitions in 
which the signature is reversed. Our selection rule (8.1) for 
k is thus narrowed down to 


k * - 1, - M + 1. (8.3) 


The following table gives the value of the auxiliary quantum 
number k associated with each possible combination of l and j : 



1 2 3 4 5 • • • 
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II. Transition to the limit c oo. 


In order to return from relativistic to ordinary mechanics 
we must pass to the limit c-+ oo. Before applying this to 

equations (7.8) we must replace 27, v by w 0 + — , cv ; we then 
have, on neglecting ^ in comparison with -jp 



on eliminating w we obtain 




or 


A /d 2 kjkj-Jf 


2 m\dr 2 


+ Uv = 


On introducing Z by (8.2) we have in both cases — 1)=Z(Z+1). 
Hence in the limit terms with the same Z, and therefore those 
with auxiliary quantum numbers k and — k — 1, coincide with 
that one associated with azimuthal quantum number Z in the 
scalar theory of Chapter II. The doublet found in alkali spectra 
— and in general the multiplet structure of spectral lines — is 
accordingly explained as a relativistic phenomenon. 


III.. H, He 4 *, • • •. 

In a Coulomb field with nuclear charge Ze we have 



employing Heaviside units, which are better adapted to a field 

theory. In the following calculations we shall denote the 

oc oc 

multiple of the fine-structure constant simply by a itself, 

and we shall set m 0 c = v 0 . In order to integrate equations 
(7.8) we first perform the substitution 

v = e~P f • F } w = e~& r • G, 

where J3 is a positive constant. Our equations are then 


(s-S f +“ c =^-G+ t) g - 


(8.4) 
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Our method will lead to a solution if we choose the constant j8 
in such a way that the determinant of the linear combinations 
of F and G on the right vanishes : 

G) ~ (?) + p = °. vV - v 2 . (8.5) 

We new seek a power series solution 

where the exponent jx begins with an initial value [x a and runs 
through the values /x 0 , /x 0 d* 1, /x 0 -)- 2, • • •. On substituting 
these in (8.4) we obtain the recursion formulae 

0* + ^ ~ a - 7W1 + ]8 fv-i, 

v (8.6) 

a^+ { f x—k)a li = fi b^ v 

The initial exponent /i = /x 0 is determined by the fact that the 
determinant of the coefficients of on the left must vanish 

for this value of the index : 

[x 2 — k* + a 2 = 0 ; ix 0 — Vk % — a 2 . 

Because of the manner in which £ was determined in (8.5) there 
exists a linear relation, with coefficients ~ + _8 between the 

right-hand sides of (8.6) which is satisfied identically in a b u 
Hence for all fx 

(" ■(•' ~) [0* + 0[a bn + (/a — &)aj = 0 

or 

*m[(- d- j)(m A) d~ -+ «/.[^(m — *) — + ?) a == 

(8.7) 

The power series will fireafe off with the term with exponent /x 
if on replacing a^_ x , b^-i by iv the right-hand side of (8.6) is 
made to vanish. The condition for this is that 

PK+(-- V j) a » = 0-, (8.8) 

it will be satisfied in virtue of (8.7) if the determinant of the 
coefficients in these two equations vanishes : 

G - ?)[(; + ?)«• + *> + ■/•]-*[* -*)-(( + 5)*] - • 
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or by (8.5) 





/f 

a* 


Since the exponent /x with which the series break off must be 
of the form /xq + n, where n is a positive integer, we obtain the 
fine structure formula 


- v>4-“ “ « ( ” + Vk ‘ ~ *' > 


(8.9) 


The solution of our differential equations, for the char- 
acteristic values v = cE defined by (8.9), is of the form 

e -P r . r Mo. (polynomial of degree n in r) 


and satisfies the condition that the spatial integral of |^r| 2 con- 
verge in the neighbourhood of the singular points r — 0 , oo. 
These E consequently constitute the discrete term spectrum of an 
ion with nuclear charge Ze and having but one electron outside 
the nucleus. If we neglect the small constant a in comparison 
with k , E depends only onw + \k\. This fine structure formula 
further tells us that the two terms with auxiliary quantum 
numbers k and — fe, or the two terms with the same j and for 

which l = j ± exactly coincide. That this is in fact found 

to be the case has already been mentioned in § 4. Equation 
(8.9) has had a remarkable history. It was first derived on the 
basis of the older quantum theory by Sommerfeld and, at about 
the same time, verified by the experiments of Paschen ; it was 
perhaps the greatest triumph of that theory, next to Bohr's 
explanation of the Balmer series and his calculation of the 
Rydberg number from universal atomic constants. The new 
quantum theory at first destroyed this beautiful agreement, 
as in its scalar form it led to (8.9) with the ha If -integral quantum 
number j in place of the integral \k\. So mmerf eld’s original 
formula was only completely re-established with the advent 
of the Dirac theory here discussed. The quantum number k ) 
which was used in the older quantum mechanics in place of l 
and which may assume the value 0, has also re-appeared and 
is now supplied with a sign. But on the other hand, the number 
of components in the fine structure is now greater than in 
Sommerfeld’s theory, as in addition to the transitions k-+k— 1 , 
k + 1 we may now also have k -> — k ; this addition is also in 
agreement with experiment. 
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Our conclusion that (8.8) was to be satisfied in virtue of 
equation (8.7) for the unknowns a ^ b ^ assuming that the deter- 
minant of the two equations vanished, fails when both coefficients 
of equation (8.6) are zero : 

v + __ __ a _ \i — k 

eft ft k a 

It follows from this that then p, = Vfe 2 — a 2 , orn = 0, and that 

ft -(- k < 0, or k < 0.. There actually exist no terms n = 0, 

k = — 1, — 2, • • \ For the coefficients a ^ b * of the beginning 
term in the corresponding solution, which is at the same time 
the end term, would by (8.6), (8.8) necessarily satisfy the equations 

{jJ- + k)b„ - a 0, aZv -+•(/* — k)a u = 0, jS b,, + V \ = 0 

* c 

or 

f — — = N ) — a — It ~~ ^ 1 — v o 

\ ^ / ft + k a cf3 ’ 

and this is impossible because of the condition |y| < v 0 . 16 

In accordance with the foregoing we may describe ^ normal 
state of the hydrogen atom ; n = 0, k = 1 (Z = 0), as follows. 
We take the quantum number m, which may assume either of 
the values 0, — 1, to be 0. Let a = 0-532 A. be the radius of 
the first Bohr orbit and a = 7-29 • 10“ 3 the fine-structure con- 
stant. fa, if / 2 ; fa', iff are obtained by multiplying the radial 
function 

A (r) = e~ r l a • r s/r=T*-i 

with the factors 

(1 + V l — qc 2 ) + za cos 0, za sin Be'* fa, fa 
(1 + Vl — a 2 ) — za cos 9 , — za sin 9 e l * fa, fa. 

We find from these expressions that the probability density if/ if/ 
is distributed spherical-symmetrically in accordance with the 
law 

p = [A(r)] 2 . 

The normalization is here not chosen in such a way that the 
integral of p over all space is unity ; it is actually 

+ 2v / r=^2). 

We have already seen that in a certain sense the probability 
density multiplied by — e represents the distribution of charge 
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in the atom. Considering the probability current as deter- 
mining the convection of this continuous charge distribution p t 
we find that it represents a circulation about the z - axis with 
velocity a c sin 8 (a c is the velocity of the electron in the first 
Bohr orbit on the older theory). On giving the axis of rotation 
all possible directions iff runs through the 2-parameter family 
of characteristic solutions for which n = 0, k = 1 ; we may 
take as a basis for this family of solutions the above (m = 0) 
and that for which m = — 1, representing a circulation in 
the opposite direction. 

C. The Permutation Group 

§ 9. Resonance between Equivalent Individuals 

The Hermitian forms Q, which represent in system-space all 
possible physical quantities of a given system, constitute a 
totality E within which addition and multiplication is defined. 
If 2 were reducible we could choose our co-ordinate system in 
system-space in such a way that all Q would be simultaneously 
completely reduced ; these individual parts into which the whole 
would be divisible would then each constitute solutions of the 
quantum problem which were merely accidentally joined to- 
gether to form the given solution. In accordance with the 
fundamental Aristotelian postulate of “ nihil frustra ” Nature 
could hardly be expected to indulge in such a superfluous luxury. 
Hence we propose the thesis that E is an irreducible system . On 
introducing as fundamental quantities the canonical variables 
as in II, § 11, this assumption contains the requirement that it be 
impossible to choose co-ordinates in system-space in such a way 
that the 2 f matrices q Xi • • •, ; p lt • • •, p f are simultaneously 

completely reduced. This postulate is to be added to the Heisenberg 
commutation rules as an essential supplement . 

In accordance with Burnside’s theorem [III, § 10], which 
we carry over without scruple from spaces with a finite number 
of dimensions to those with infinitely many, the irreducibility 
postulate allows us to assert that there can exist no linear 
homogeneous relation tr(AQ) = 0 between the components of 
Q which is satisfied for all Q. Since in the domain of the Q's 
not only is multiplication possible — as presupposed in Burnside's 
theorem — but also addition, we arrive at the conclusion that all 
Hermitian matrices in system-space are contained in E. It is 
perhaps desirable to express our requirement directly in the 
form : any Hermitian form represents a physical quantity of 
the system. In accordance with II, § 7 there is associated with 
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each statistical ensemble a positive definite Hermitian form A 
in such a way that tr(AQ) is the expectation of the quantity 
represented by Q. Burnside’s theorem asserts that the equation 

tr (AQ) = tr (A'Q) 

can be satisfied for all Q only if A = A\ or it is impossible to 
distinguish between the two statistical aggregates represented by 
the positive definite Hermitian forms only if A = A\ In particular 
it follows from this that the states represented by two rays in 
system space are physically different if the two rays are distinct ; 
this was to be expected, or even required, from the outset! 
These consequences show the naturalness and cogency of the 
irreducibility postulate, from which it can conversely be deduced. 

The states of physical entities 1 which are fully equivalent , as, 
for example, the electrons in an atom, are to be represented by 
vectors £ ===== (x t ) or rays in the same system-space 9L If two 
such individuals unite to form a single physical system 1 2 the 
vectors of the corresponding system-space x 91 = 9t 2 are, 
in accordance with the general rule of X -multiplication, the 
tensors (x ik ) of order two. But, by III, § 5, 9t 2 is reducible into 
two independent sub-spaces {5R 2 } and [9t 2 ], the space of anti- 
symmetric and the space of symmetric tensors of 2nd order. 
Physical quantities Q of / 2 have only an objective physical 
significance if they depend symmetrically on the two individuals. 
This requirement is expressed in terms of the elements of the 
Hermitian form 

Q = Uq iJC} Vh . x ik x { > v 
by the symmetry condition 


Qki, k'i' = ( 9 . 1 ) 

On reducing (. x ik ) into its anti-symmetric and its symmetric 
parts, 

x ik = x{ik} + x(ik) (9.2) 

Q is reduced, in virtue of (9.1), into two Hermitian forms in 
x{ik } and x{ik) respectively. For on substituting (9.2) into Q 
we obtain four terms : those in which {fR 2 }, [9t 2 ] intersect them- 
selves, and the two in which {9t 2 } intersects [?R 2 ] or conversely. 
These last two then vanish, for if we interchange the dummy 
indices i with k, i' with k! in 

[0] = x{ik}x(i'k’) 

and then replace 

x{ki), x(k'i’) by q tk , n ; — x{ik}, x(i'k') 
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we find [Q] = — [Q], or [<2J = 0. The totality of Hermitian 
forms Q which represent the quantities of 7 2 depending sym- 
metrically on the two individuals is therefore not irreducible ; it 
can be reduced in accordance with the decomposition 

0i 2 = {9*2} + PR2] (9.3) 

of the space 0ft 2 . 

In particular, every possible interaction between the two 
individuals depends symmetrically on them, even when other 
physical elements, such as a radiation field, are also involved. 
Hence if 7 2 is at any time in a state contained in one of the 
sub-spaces {9ft 2 } or [0ft 2 ] it is for all time impossible to get it out 
of this sub-space by any influence whatsoever . Again, we expect 
Nature to make use of but one of these sub-spaces, but the 
irreducibility postulate offers us no clue as to which one she 
has decided on. 

Take as co-ordinates in the system space 01 of the individual 
I the principal axes e< of the energy associated with the char- 
acteristic numbers E+ Disregarding the interaction between 
the two individuals for the moment, the system 7 2 has as energy 
levels Ei + E k with characteristic vectors e< X e fc = e ik ; each 
characteristic number of the type E x + E 2 appears twice, and 
the corresponding characteristic space is spanned by the vectors 
e 12 and e 21 . On introducing the interaction as a small per- 
turbation the two states e 12 and e 21 are in resonance with each 
other. Denoting the components of the total Hamiltonian 
function by H(ik, i'k'), the transformation of the sub-matrix 

77(1 2, 1 2) 77(1 2, 2 1) [I 
77(2 1, 1 2) 77(2 1, 2 1) || 

to principal axes, as required by perturbation theory, can in 
the present case be performed in a manner which is universally 
valid ; we need only to replace the fundamental vectors e 12 , e*j by 

^=(e 12 — e 21 ), y=(e 12 + c 21 ). (9.4) 

Denoting 77(1 2, 1 2) = 77(2 1, 2 1) by hv and the numbers 
77(1 2, 2 1) = 77(2 1, 1 2^ which must be real in virtue of the 
condition 77(1 2, 2 1) = 77(2 1, 1 2) of Hermitian symmetry, by 
Aa, the resonance equations become 

1 dx^ 
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from which it follows that 

i{v — ct)(x lt — x n ), 

d(%i2 "4" ^ 41 ) v 1 \/ 1 \ 

= — i(v + a.)(x 12 + x 21 ). 

Taking as initial conditions x 12 = 1 , x n = 0 for t = 0, we find 

= *12 + *21 = ; (9.5) 

| *12 | 2 = cos 2 | *21 | a = sin 2 «f. 

We see from this how the two states e 12 , e 21 alternate back and 

2 tt 

forth with the beat period — , whereas the components (9.5) 

along the axes (9.4) have always the same constant absolute 
magnitudes. 

The only characteristic numbers associated with the system 
space {9i 2 } are those of the type E x + E % each of which appears 
exactly once, but the sub-space [9ft 2 ] has simple characteristic 
numbers of the type 2 E x in addition to these. Hence if Nature 
decides in favour of {9ft 2 } both individuals can never be sim- 
ultaneously in the same quantum state with energy E x — assum- 
ing this energy level for the individual system is non-degenerate. 
That E x + E 2 occurs only once in {9ft 2 } and only once in [9t 2 ] 
means : the possibility that one of the identical twins Mike 
and Ike is in the quantum state E x and the other in the quantum 
state E 2 does not include two differentiable cases which are 
permuted on permuting Mike and Ike ; it is impossible for 
either of these individuals to retain his identity so that one of 
them will always be able to say “ I’m Mike ” and the other 
“ I’m Ike.” Even in principle one cannot demand an alibi 
of an electron ! In this way the Leibnizian principle of coin - 
cidentia indiscernibilium holds in quantum mechanics. 17 

On passing from 2 to / equivalent individuals I it is not so 
easy to reduce the representation (c :)/ of the complete linear or of 
the unitary group in system-space 9t into its irreducible con- 
stituents ; we shall go into this matter in the last chapter. 
Nevertheless we know from III, § 5, that the anti-symmetric 
and the symmetric tensors of order / with components 

x{k x k 2 * • • k f ), x(k x k 2 • • • k f ), 

respectively, each yield such an irreducible representation. 
A physical quantity Q of the total system If which depends 
symmetrically on all / individuals will be represented by an 
Hermitian operator Q, the coefficients q{k x k 2 • * * k f ; k[k 2 * * • k }) 


d(x X2 x 21 ) 
dt 
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of which are unchanged on subjecting k x k 2 • • * k f and k’ x k' 2 * • • hx 
simultaneously to the same permutation. It is evident that such, 
an operator always sends an anti-symmetric tensor x{k 1 k 2 * * * k/\ 
into an anti-symmetric tensor %' \ 

•••&,} = 2q(k x k : 8 ‘ • k,; k\k'i • • • k' f )x{k[k 2 • • • k' f }. 

Hence the sub-space {9?/} of anti-symmetric tensors is reduced 
out of the system-space W of V , determined in accordance with 
the general rule of X -multiplication, in such a way that if 1 f 
is ever in the system space {SR/} it remains there forever, regard- 
less of what influences may act upon it. The sub-space [SR-/] 
of all symmetric tensors x{k) of order J can similarly be separated 
out of $R/. The energy level E l + E 2 + ■ • • + E f) which is 
/!-fold degenerate in SR/, appears in {SR/} as a simple level. Only- 
characteristic numbers of this type appear in {SR/}, but the 
characteristic numbers of [SR/] are all numbers which can be 
obtained by summation of / distinct or non-distinct energies E. 

If the system space is w-dimensional, {SR/} is only possible 
if / ^ n. If E is an n-fold energy level of the individual / then 
the quantum states with energy E constitute an w-dimensional 
sub-space SR(fi). If it should happen that only {SR/} is realized 
in Nature, then in view of the foregoing it would be impossible 
to have more than n individuals of the system // in the Quantum 
state E. 

The reduction of SR/ to {SR/} or [SR/] involves relationships 
which frustrate any attempt at description in terms of our 
old intuitive pictures with their orbits and billiard-ball electrons. 
But the difficulty enters already with the general composition 
rule, according to which the manifold of possible pure states 
of a system composed of two parts is much greater than the 
manifold of combinations in which each of the partial systems 
is itself in a pure state. 


§ 10. The Pauli Exclusion Principle and the Structure 
of the Periodic Table 

One of the most fundamental facts of Nature, the ordering of 
the chemical elements in the periodic table, can be understood 
only with the help of these considerations. We go from one 
atorn to the following, which we denote by A, in two steps : 
he first is preparatory and consists in increasing the charge 
on the nucleus by 1, and the second and final step consists fn 
adding an electron to the ion A+ so obtained. To obtain the 

SX£ a °m ^- thl \ additiona l electron m ust be bound as 
tightly as possible, i.e. the energy of the total system A must be 
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a minimum. If we disregard the mutual perturbations of the 
electrons for a moment, although they may be very considerable, 
we might expect to find every electron in an unexcited atom in 
the lowest energy level, i.e. with principal quantum number n = 1. 
But instead we find the following : The 1 electron of H and the 
2 electrons of He are in the Is orbit, i.e. they are in the quantum 
state n = 1, l = 0. But the next 2 electrons, which are added 
in going over to Li, Be, are in a 2s orbit, and the additional 6, the 
addition of each of which gives rise to one of the elements from 
B to Ne, enter the 2 p orbit. Then follow Na, Mg, each with a 
new electron in the 3s state, the elements from A1 to A, the 
additional electrons entering the 3 p orbit, etc. These facts 
are readily seen on writing the wave number of the lowest 
S term in the form — Rjn\ ; in H, He, Li the “ effective 
quantum number ” rc* has the values TOO. 074, L59. That 

sinks on going from H to He is understandable in view of 
the “ screening ” effect of the original electron on the new one. 
We should expect that if the next electron also went into the 
orbit n — 1 the corresponding value of w* would be something 
like 0’59, but we find instead a number which is greater than 
this by unity. The same occurs on going from Be to B or from 
Mg to A1 ; the normal states of these atoms are formed by the 
valence electrons entering 2 p or 3 p orbits because the 2s or 3s 
orbits are already “ occupied,” and if the valence electron is 
raised to an s state by excitation, it can only be raised to one 
for which n ^ 3 or « 2> 4.* Obviously the essential features 
of the regularities expressed in the periodic table depend on this 
mysterious numerus clausus for the various states with principal 
quantum numbers n — 1, 2, • • • and on the fact that in conse- 
quence of this the electrons in the atom are added on in definite 
layers or “shells.” Stated more precisely, in an ns orbit 
(n — 1, 2, • • •) there is room for but 2 electrons, in an np orbit 
(n = 2, 3, • • •) for but 6 ; in general the situation is described 
by Stoner’s rule : there can be at most 2(2/ + 1) electrons in a 
state with quantum numbers n, l. 

On taking into account the duplicity caused by the spin we 
see that this number is exactly the dimensionality of the sub- 
space 9t(n/) in the system-space of a single electron. Neglecting 
the spin perturbation, which is indeed much smaller than the 

* The physical significance of the “ true principal quantum number ” 
n is contained in these considerations : we think of the term in the Hamiltonian 
function which represents the energy of interaction between the various 
electrons as multiplied by a numerical factor A and let A decrease steadily 
from i to o ; this virtual adiabatic process sends each electron into a definite 
hydrogenic orbit with a principal quantum number n, the “ true quantum 
number " of the electron. 
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mutual perturbations of the electrons, the energy level associ- 
ated with this sub-space is 2(2 1 + l)-fold degenerate. This 
degeneracy can be removed by the introduction of the spin 
perturbation and a weak magnetic field ; the energy level is 
then broken up into 2(2 1 + 1) simple components distinguished 
by the quantum numbers 

J = l ±2’ m== 3> 3 ~ * * *» -1- 

Stoner’s rule led Pauli to postulate the exclusion of equivalent 
orbits : it is impossible for two electrons in an atom to be simul- 
taneously in the same quantum state ( n , /, j, m). This shows 
that Sftf is obviously not the system space of the physical system 
V in which / electrons revolve about a fixed nucleus, but that 
the reduction to {ffi} takes place : Nature has decided in favour 
of the reduction to the space of anti- symmetric tensors , at least in 
the case of electrons. In view of the considerations advanced in 
the previous paragraph this principle leads conversely to Stoner’s 
rule. 18 

If the formation of one atom from the preceding one were 
an entirely regular process the occupation of the various states 
would take place in accordance with the following table, the 
lower row of which indicates the number of electrons captured, 
on going from atom to atom, by the orbit immediately above : 

Is; 2s, 2 p] 3s, Bp, id- 4s, ip, id, 4 /; • • • 

2; 2+6; 2 + 6 + 10; 2 + 6 + 10 + 14 ; • • •’ 

This would indeed be the case if we could increase the charge on 
the nuclei by some large fixed amount, for the mutual perturba- 
tions of the electrons could thus be made arbitrarily small in 
comparison with the Coulomb attraction of the nucleus. But 
even a rough calculation shows that these perturbations are 
actually too considerable not to lead to displacements in the 
above table, i.e. to changes in the order in which the various 
shells are filled. For example, after the 3 p shell is filled, which 
is accomplished with A, the next 2 electrons go into 4$ states 
to form K, Ca, and only then do we find electrons entering the 
3d orbits to form Sc, Ti, • • \ For details consult the books 
by Hund ) Pauling and Goudsmit or Ruark and Urey mentioned 
in the Introduction. 

It is not the purpose of this book to report on the extensive 
empirical data of spectroscopy, nor to show how the two main 
principles required to lead beyond the general scheme of quantum 
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mechanics to the interpretation of spectra were wrested from 
this material ; I here refer to the introduction of the inner 
quantum number j in addition to the azimuthal l y or the spinning 
electron, on the one hand, and to the reduction of ffi to {ffi} 
by means of the Pauli exclusion principle on the other. 
Millikan begins his report to the American Philosophical Society 
on “ Recent Developments in Spectroscopy ” [Proc. Am. Phil. 
Soc. 66 , p. 211 ( 1927 )], with the words : “ Never in the history 
of science has a subject sprung so suddenly from a state of com- 
plete obscurity and unintelligibility to a condition of full illu- 
mination and predictability as has the field of spectroscopy 
since the year 1913 .” The theory of groups offers the ap- 
propriate mathematical tool for the description of the order 
thus won. 

The lines of the optical spectrum are caused by quantum 
jumps of the electrons which are most loosely bound. In the 
alkalies Li, Na, K, • • • the one involved is accordingly in the 
state 25, 35, 45, • • \ We also understand why their cores 
Li f , Na + , K% • * • are spherically symmetric, and therefore 
why their spectra may be approximately calculated in terms 
of the motion of an electron in a spherically symmetric field ; 
the real reason behind this is the following. That an electron 
has the quantum numbers n, l means that its state is in a 
sub-space Sftj of A = 2(21 + 1) dimensions. The sub -space 
{% X X * * * X 91 with A factors, as obtained by the anti- 
symmetric reduction of 91*, is 1 -dimensional and the rotation 
group induces in it the 1-dimensional identical representation ; 
i.e. a shell consisting of A electrons in the state n, l acts spherical - 
symmetrically ; its presence does not increase the manifold of 
terms. Hence the “ closedness ” of those elements with which 
a shell is completed ; the rare gases, which precede the alkalies, 
are elements of this kind. But we should also expect Cu, Ag, Au 
to have alkali-like spectra, as they contain but a single electron 
in the 5 state, while all the others are bound more tightly in 
a “ closed ” configuration with an external field which is spheri- 
cally symmetric. The valence of the elements must obviously 
find its explanation in these terms ; indeed, it gave the clues 
which originally led to the discovery of the periodic table. 
But only in recent times have we been able to call on the assist- 
ance of spectra, interpreted and arranged with the aid of atomic 
theory by Bohr and others, and they have verified the principal 
features of the table, while modifying, supplementing and 
improving its details. 

The consequences of the Pauli principle for the term analysis 
of atomic spectra will be discussed in detail in Chapter V, 
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particularly in § 15. We here mention briefly the results for 
the case of 2-electron spectra / = 2. 

Just as the alkalies may be treated as if they were but 
1-electron atoms, in dealing with the alkaline earth metals we 
need only take into account the two most loosely bound electrons 
which occupy an s orbit outside a spherically symmetric closed 
shell. As before, we obtain one singlet and one triplet term 

(»Z, n7 ; L) 

whose total azimuthal quantum number L assumes the values 

L = l + l + V - 1, • • *, 1 1 - /' | 

assuming that the two quantum states (nl), («'/') of the individual 
electrons are distinct. The only difference is that now such 
a term appears only once, whereas before it appeared twice, 
corresponding to a permutation of the electrons. The situation 
is, however, more complicated if (nl) = (n'T). The only singlet 
terms 

[nl ; nl) L) 

which actually occur are those with even L = 0, 2, • • •, 2Z and the 
only triplet terms are those with odd L = 1, 3, • • -, 21 — 1. This 
rule is thoroughly in accord with the empirical data. 

The best-known lines of the spectra are those arising from 
transitions in which only one electron is not in the normal state 
and is jumping between higher energy levels. Hence if one 
of the two electrons (not saying which !) is in the normal state 
ri = n 0 , V = 0 (n 0 = 1, 2, 3, 4, • • • for He, Be, Mg, Ca, • • •) 
we have L = l and the two quantum numbers (n, l) suffice to 
determine the singlets or triplets. The lowest S term (L = 0) 
of the singlet system has the principal quantum number n = «q, 
but there is no such term in the triplet system ; it begins with 
n == n e -j- 1. We find that the lowest S term in such a triplet 
system (which is, as we know, simple), e.g. in the spectrum of 
Mg, actually does lie in the neighbourhood of the second lowest 
S term of the singlet system instead of the lowest. 

§ 11. The Problem of Several Bodies and the Quantiza- 
tion of the Wave Equation 

In this paragraph we depart from our usual terminology 
and denote the number of individuals by n instead of /. We 
first consider more fully the reduction of 5R* to [3ft n ], for we shall 
find that although it does not apply to electrons, it does to 
photons. Let H = ||if^|| be the Hamiltonian function of an 
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individual. The variables n 2 , • • •) of the unitary space 
[SR”] behave like the monomials 


'V M 2 • . . 

At ! At 2 

\/ n x \n 2 \ • * * 


K + n 2 + • • * = n), 


( 11 . 1 ) 


of degree n which are formed from the components x M of an 
arbitrary vector in ; we denote this monomial (11.1), without 
the denominator, by <j f>(n l} n 2) • * •). We shall have occasion 
to use the differentiation formula 


d{%l l * * •) == (wi %" 1 1 * * • dx x ) + (n 2 • • dx 2 ) + * * • . 


In the absence of interaction between the individuals we obtain 
from 

7 d if+IH x ,x, = 0 (11.2) 

the equation 

— I n 2 , • • •) = »,.£#, ,)<£(«, - 1, n 2 , • • •, n fi + 1, • • •) 
1 fi 

+ n % IJHzp n 2 1) ‘ * “i n p + 1, ‘ * *) 
P 

+ • 

In the sum on the right <f>(n x — 1, n 2> * •' *, + 1, • • •) is to 

be interpreted as <t>(n x , n 2 , • • •) for j3 = 1 ; similarly for the 
term with )3 = 2, etc. We can also write this equation 

“ «a, * • •) = 2X * £(»x, ‘ * *) 

1 a 

4- IX • </>(• •*,»« — 1, •••,%+ l, • • •)• 

a 


On introducing the binomial coefficients in accordance with 
(11.1) we obtain as the equations of motion 


1 #(«i, 

i 


Uft 


dt 


_•) 


JL/Wm (X 


+ZVn x (nt,+ l)H ar j,(- ■ ■, 


• K »*, * • •) 

«»— 1, • • «/>+!, ■ • •)• 


These equations are of the form 


( 11 . 3 ) 


Yi + H * = °. 


H = UH„^-q a p 

«, P 


( 11 . 4 ) 
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where the matrices -r]^ are defined by 

i if = «i» «2 = «*, . • 


y<*x( n i, n t , 


« 1 . » 2 , • • *) 


K 

\0 


otherwise 


• (H.5) 


and for a =j= ^3 

^(»i, «2. * • • ; «i, ’ ') = {J /na(W/> + ^ (H-50 

where the first alternative holds when all n' = n with the ex- 
ception of 1, + 1 and the second in 

all other cases. H is, as it should be, an Hermitian matrix. 
If H is in diagonal form the fundamental vectors forming our 
co-ordinate system are the quantum states of the various in- 
dividuals ; |i//(w 1 , n 2 , • • *)| 2 is then the probability that there 
are simultaneously n x individuals in the first quantum state, 
n 2 in the second, etc. On reduction from to [9i n ] it becomes 
impossible to identify the individuals as Mike, Ike, • • • and we 
therefore may not ask for the probability that Mike is in the 
a th state, Ike is in the /7 th , • • \ If we have in addition to H a 
perturbation s W affecting the individuals (and symmetric with 
respect to these individuals), then equation (11.3) governs the 
change of the probabilities )0(n lf w 2 , • • *)| 2 in time. 

The Hamiltonian function H reminds us of the one which we 
obtained in Chapter II, § 13 by quantizing Maxwell's equations ; 
there the individuals were photons. Maxwell's equations are 
to be considered as the quantum-theoretical wave equations of 
an individual photon. If we replace the photon by an individual 
whose state (x a ) varies in accordance with equation (11.2) we 
are led to a new way of treating the problem of several bodies, 
which we call the 41 method of second quantization ” in contrast 
to the 44 method of composition ” or 44 X -multiplication ” de- 
veloped in Chapter II, § 10. In this we consider (11.2) as the 
classical equations of motion of a physical system whose canonical 
variables are the real and imaginary parts q v p * of x «, and as 
such subject them to the process of quantization. 19 We here 
tie on to the development given in Chapter II, § 11. Introduce 
the complex quantities 

X * = = “ *P«) 

into the Hamiltonian function H as independent variables in 
place of q X) p x ; the Hamiltonian equations are then 

d%* = __ .iH dx* __ .bH 

dt 1 bx a y dt 1 bx* 


( 11 . 6 ) 
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In order that (11.2) may be considered as the classical equations 
of motion of a system with infinitely many degrees of freedom, 
in accordance with our programme, they must be of the form 
(11.6). But this is in fact the case ; the Hamiltonian function 
is then 

H = ZH^Xp. 

<X,P 


In quantizing x are to be replaced by Hermitian conjugate 
matrices x*, x a which satisfy the following commutation rules : 


x * Xp ~ Xp X* = 0, x* 3tp — Xp x* = 0, 

xx* xox S _/l (« = « 

“ 0 0 “ ~ B * 0 ~ \0 (x 4= j8). J 

The Hamiltonian function H then becomes the matrix 


(11.7) 


H = ZH a p x a Xp ; (11.8) 

<x,fl 

if // is in diagonal form then 


H Z^at X x X a . 

a 


We are here dealing with an infinite set of oscillators, the in- 
dividual members of which are distinguished by the index a ; 
the energy of the a th is given in terms of the complex co-ordinates 
**, by E a x a x a . 

The quantum theory of a single oscillator as developed in 
II, § 3 gives us as the irreducible solution of 


XX — XX = 1, 


where x, x are two Hermitian conjugate matrices normalized 
in such a way that the energy xx is in diagonal form, the matrices 

x i n j n + 1) = Vn + 1, x(n, n — 1) = Vn ; xx(n, n) = n , 

all other components vanishing ; the quantum number n assumes 
the values 0, 1, 2, • • \ From this we obtain the solution of 
(11.7) by composition : 

I Vw -4- 1 ^ n ' = n 

x J\ n i> n 2 ) * * * ; n[ f n 2 , * • •) = i * ’ except ri a = n* f 1, 

(o otherwise ; 


x *( n i, n 2 , 


n i, n 2 , 



if all n' =n 
except n* = n a — 1 , 
otherwise. 


The products x*x« are of course in diagonal form; x^Xp is the 
matrix introduced above, and (11.8) coincides with (11.4) : 
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the method of second quantization leads to the same result as the 
method of composition supplemented by the “ symmetric reduction ” 
of*8i n to [ 8 t n ]. But now the number 

ft ± + ^2 + * * ' = n 

of individuals is not prescribed ; however H is reduced into 
sub-matrices in accordance with the various values of n, for 
all components H (n^ • * • ; n[n 2 • • •) for which w' 4* + 
• • * 4 : Wj + n 2 + • * * vanish. The total number of photons 
is not conserved, and to this extent Maxwell’s equations do not 
fit completely into the quantum-theoretical picture — unless we 
wish to consider “ non-existence ” as a particular quantum 
state of the photon. 

The method of composition remains applicable in the presence 
of interaction between the individuals, provided it is an in- 
stantaneous action at a distance determined by the simultaneous 
values of the canonical variables of the various individuals. 
But it breaks down when, as in the theory of relativity, account 
is taken of the finite velocity of propagation, which led to the 
introduction of continuous fields in the classical theories. The 
difficulty arises from the fact that the wave function \fj must 
contain the one time t as argument in addition to the spatial 
co-ordinates of each particle, whereas the theory of relativity 
requires that the proper time of each particle appear as argu- 
ment in as well as the spatial co-ordinates. The method of 
second quantization shows its superiority in dealing with such 
problems. 

As we have seen, the method of second quantization in 
accordance with Heisenberg’s commutation rules is equivalent 
to a reduction of the system space 9 i n to [SR n ]. Since we have 
seen in II, § 13 that this leads to the correct laws of radiation 
phenomena, we must conclude that the behaviour of photons 
corresponds to this reduction. But in the case of electrons the 
reduction is to the space {S?*}, and we must now investigate 
to what kind of quantization this corresponds.^® The vectors 
of the unitary space {9t n } are the anti-symmetric tensors with 
components 

*2, • * *, a n} ^ K, ‘ •, Xj (11.9) 

in the space 9?, where the one row in the determinant stands for 
the n rows formed in the same manner from n vectors j = j^ 1 ), 
? » S (n) We can obtain the totality of linearly 

independent components by restricting the indices by the 
condition 


a i < a 2 < • ■ * < a n . 


( 11 . 10 ) 
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We now denote (11.9) by i/j ( n h n 2j * * *), where n a = 1 or 0 
according to whether a appears in the set of indices a Xj a 2 , • * •, 
a n or not ; these quantum numbers n a may thus only assume 
one of two values. On replacing a x = a in (11.9) by an index 
j8 =)= oc, (11.9) vanishes if jS is equal to one of the remaining 
indices oc 2 , * ■ *, a n ; if is different from a 2 , * • •, a n it becomes 

• • • «»} = ± <A(%, • • •, — 1, • • % rif, + 1, • • •), 

the sign ± 1 being (— l) r where r is the number of indices in 
the set a 2j • • a n lying between oc and /3 : 

r = 

A 

where the sum is extended over all indices A between a and j8. 
We again obtain equations of the form (11.4) ; (11.5) is then 

valid as it stands but (11.5') is to be replaced by 

w 2> • • • ; *4 • • •) = ± 1 or 0, 

where the first alternative applies to the case in which all n' — n 
except w* = 1, = 0 ; n p — 0, ri p = 1, the sign being again 

determined in accordance with the above rule. On writing 
a matrix ||a(nn')|| in the form 

a ( 0 0 ) a ( 0 1 ) 
a(l 0) a( 1 1) 

and introducing the abbreviations 


1 

o 

r 

hi 

0 

0 

i 

o 

- 1 


we may write 

^=1 X 1 X • • • X ^ X 1 X 1 X * * •, 

^=1x1 X • • • X Xl'X • • • Xl'X qJ xlx • • • («=#£), 

where the matrix that is written explicitly in the first equation 
is in the « th place and those in the second in the a th and )3 th 
places respectively. We must now attempt to write these 
matrices in the form x a x fi ; this can in fact be accomplished by 
taking 

X a = V X 1' x • • • X r X ? I X 1 X 1 X • • •, I 

uu t (11.11) 

5. = l'xl'x.- ■ • Xl'X j q X 1 X 1 X • • *,J 
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the small explicit matrices being in the « th place. x Xl x a are 
Hermitian conjugates, and H can now be written in terms of 
them in the desired form (11.8). Instead of the commutation 
rules (11.7) we now have 

x a xp -j- Xp x x = 0, x a xp -(■ x p X x = 0, x x Xp -f- xp x x = (1 1. 12) 

(11.1) is the irreducible solution of these equations by a pair of 
Hermitian conjugate matrices x X) x x which are so normalized 
that x x x x is a diagonal matrix. 

In order to show that the equations (11.4) for the vector tft 
in system-space yield the Hamiltonian equations (11.6) for the 
forms 

x x = 2X(« ; ri) 4>(n) ifi(n') and x„, 
we must prove that the formula 

x a H — H x a = ~ 

oX# 

employed in II, § 11, holds here as well. We find that it does 
not hold for an arbitrary polynomial H in x„, x^ but that it 
does for even polynomials in general and so in particular for 
the Hermitian form (11.8). For we have, for example, 

-*1 X x Xp = 8,„ Xp — x a x ± Xp = 8 lct Xp + X x Xp x lt 

whence 

X x • X A Xp — X m Xff X x =z h lct X Pl X x H — Hx 1 =s HHjpXf}. 

$ 

On introducing real quantities, i.e. Hermitian forms, p a , q # 
by 

x * = \ (<1« + iP*), *« = l (?« - ip*) 

and denoting the set p„ q t ; p*, q t ; • • • straight through by 
Pi, Pt, Pa, Pt, ' ' * we obtain the relations 

Pi = 1, p*pfi + PeP* = 0 («*$ (11.13) 

The pc are not only Hermitian but unitary as well, as can be 
seen from the first of these equations or directly. Here again we 
meet the matrices 


0 1 


0 -i 

1 0 

f 

i 0 


which occurred in connection with the spinning electron. 
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We have thus discovered the correct way to quantize the 
field equations defining electron waves and matter waves. 
Here again we find, as in the case of the spinning electron, that 
quantum kinematics is not to be restricted by the assumption 
of Heisenberg’s specialized commutation rules. 

§ 12. Quantization of the Maxwell-Dirac Field 
Equations 21 

The field laws arise from a Hamiltonian principle which is 
analogous to the Hamiltonian principle of classical mechanics. 
This latter is expressed in terms of a Lagrangian function L 
which depends on the positional co-ordinates q { and their de- 
rivatives qi with respect to time, and asserts that the first 
variation of 

f ( 12 . 1 ) 

vanishes when the q { are assigned arbitrary infinitesimal incre- 
ments 8qt which vanish outside a certain finite time interval. 
This principal yields, on integration by parts, the differential 
equations 

Ti + L <~° ’ ith t , > = - U = (12.2) 

Defining 

H = L -f Z4iPi 

i 

and noting that 

hL = ELM - IPiMi 

i i 

we obtain for the differential of H the expression 
8 H = ELMi + Eittyt. 

i i 

Expressing H as a function of the q { and the generalized momenta 
pi associated with them, we have 

IH __ - . 

7 > qi ~ Li ' * Pi - q< 

and by (12.2) these are just the Hamiltonian canonical equations 

dq { dpi __ 7)H 

dt ~~ Ipi dt ~~ Iqi 

In quantum theory the q it pi are operators satisfying Heisen- 
berg’s commutation rules. 
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This reasoning can be carried over without difficulty to the 
case of a continuum, as appears in field theories. On replacing 
for the moment the 3-dimensional space by the 1-dimensional 
interval 0 ^ x g 1 described by the co-ordinate x and assuming, 
for the sake of simplicity, that only one state function q = q(x , t) 
is involved, the integral (12.1) is then to be replaced by 



Naturally L may depend on the spatial derivative or even 

higher derivatives, in addition to q . The continuous variable 
x takes the place of the index i and the Lagrangian function, in 


the sense of (12.1), is now the integral J L{q, q)dx with respect to 

o 

the spatial variable instead of L itself. We first replace the 
continuum by a discrete set of equidistant points defined by 

Ax = - (z = 0, 1, • • •, n — 1). The differential quotients with 
n 

respect to x are naturally to be replaced by difference quotients 
with the difference Ax = I n, and the integrals become sums. 
In accordance with the outline above we must now set 


Pi=- 


g) . 

tq 


A*, 


calculated at the point x = i/it. For the continuum we have 
analogously to set 


P = 

and H is to be defined by 


4) 

*q ’ 


H 



The commutation rules which are satisfied by q, p in quantum 
mechanics cause some trouble. As long as we employ the 
discrete set of points in place of the continuum they are 


?(*) P( X ') — P( X ') q(x) = — 1 • S^r 

Ax 

where x, x' run independently through the set i/n and 8 xa , is 
1 or 0 according as x' coincides with x or not. For fixed x' 

~ • 8 XX , = S(x - x') 
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is a function of % which vanishes for all values of the argument 
other than %' and is there so large that the sum S8(x — x') * A# 

has the value 1. In dealing with the continuum we therefore 
introduce with Dirac a function h(x — x') which vanishes at 
all points x =%= x' and is so large at the point %' that its integral 
has the value 1 (cf. I, § 7). Of course there exists no such 
function, but it can be “ arbitrarily closely approximated ” by 
a function which vanishes everywhere except in a very small 
interval about x' and assumes very large values within this 
interval. Only in this sense can we perform the passage to 
the limit A# = 0 and write the commutation rules symbolically 
in the form 

q(x) p(x') — p(x') q(x) = i 8(x — x'). (12.3) 

A good illustration of the mathematical interpretation of 
this pathological function S(# — x f ) arises in the theory of ortho- 
gonal sets of functions <j> i {x) 1 for with its aid the completeness 
condition may be formulated 

zU*) &(*') = s ( x — x ')- 
% 

This is literally correct as long as % only runs through a discrete 
set of points, but the rigorous mathematical formulation for 
the case of a continuum is given by 

1 1 » 3 

lim f f £ $i(x) <f>i(x') • ft(x) v(x') dx dx f = \u{x) v(x) dx 

n ->°° 0 0 i - i o 

where u(x), v(x) are any two continuous functions in the interval 
(0, 1). Hence from the more rigorous standpoint (12.3) must 
be replaced by the equation 

ii i 

| ^u(x){q(x) p(x') — p(x') q(x)}v(x') dx dx' = i^u{x) v(x) dx 
oo o 

containing two arbitrary functions u(x), v(x) ; furthermore, it 
is to be noted that the p ) q in the brackets are first to be replaced 
by approximations p( n ), g( w )~ e. g. by the n th partial sum of 
their expansion in terms of orthogonal functions — and the 
passage to the limit n -> oo is to take place after , not before, 
the integration. This interpretation offers a sound mathematical 
method of dealing with the relation (12.3). It is to be emphasized 
that (12.3) refers to two points of space x ) x' at the same moment 
t , i.e. in a section of the world in which t = const. ; the arguments 
of q and p are to be written more precisely as (x, t), {. x' } t) re- 
spectively. 
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On applying this general scheme to the action 

+ + (6.18) 

from which the field equations for the electron and for the electro- 
magnetic field are obtained, we find ourselves faced with a 
difficulty arising from the fact that the Lagrangian function 
does not contain the time derivative of the scalar potential jf 0 , 
for the generalized momentum associated with / 0 then vanishes 
identically and cannot possibly satisfy a commutation relation 
such as (12.3). We avoid this difficulty for the moment by 
utilizing the principle of gauge invariance to remove / 0 from the 
expression of the Lagrangian function by setting it equal to 0 ; 
this device has already been employed in II, § 13. The set of 
independent functions describing the state is then 

<A = W'i, K K &), f = ifi, ft, f 3 ), 

where we have written ^ 3 , in place of The momenta 

associated with these quantities are then found to be : with 

^r p and — E v with /*. The commutation rules which are to be 
applied in quantizing the field equations are accordingly 

UP)MP') + UP'MP) = V S(P-P') Ip, a- 1,2,3,43, (12.4') 
MP)E'(P’) - E„(P')f v {P) = • S(P-P') [*>,? =1,2, 3], (12.4”) 

where P and P f are any two points of the same spatial section 
t =s const. We have here taken account of the fact that the 
quantities $ describing matter are not to satisfy Heisenberg’s 
commutation rules, but are instead to satisfy those obtained 
by replacing the minus sign which occurs in them by a plus 
sign. These rules must be supplemented by the assertion that 
the \ft p satisfy in addition the equations 

up)un + unup) = o, (w.#) 

and the same for ; that the /, at any two points P, P' are 
commutative and the same for the E„ ; and finally that the 
material quantities </<, on the one hand and the electromagnetic 
quantities /„, E v on the other are kinematically independent, 
and that every quantity of the first kind at a point P commutes 
with every quantity of the second kind at any point P' (in the 
same section t — const, of the world). 

As in II, § 13, we again consider the whole system enclosed 
in an insulated and perfectly reflecting cavity which is at rest. 
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In order to describe the electro-magnetic potentials we make 
use of a complete orthogonal set of solutions f of 

Af + v 2 f = 0 (12.6) 

in the cavity, which satisfy the conditions 

div f = 0, f normal 

at the walls. The construction of such a system is readily 
obtained from the Gauss divergence theorem 

J(curl f • curl g + div f • div g + f * Ag )dV 

= J([f, curl g] n + \ n div g) do (n denoting normal component) 

for the vector [f, curl g] + f div g, f and g being two arbitrary 
vector fields. 22 We first determine the scalar functions <f> = cf> K 
which satisfy the equation A <f> + A 2 </> = 0 and vanish on the 
walls, and from them construct the vector fields f A = grad <^ A ; 
these vectors j\ automatically satisfy the conditions above, 
are of course mutually orthogonal and can be normalized in 
accordance with the equation 

J(fx . f y)dV = S AX '[= X^MydV]. 

We also determine a complete normal orthogonal system of 
solutions of (12.6) which are normal to the walls but which 
satisfy the condition div \ v = 0 everywhere, not only at the 
walls. The f A are then orthogonal to these f„ and they con- 
stitute together a complete orthogonal system for vector fields 
in the cavity. We may consequently write 

f = 2<Iv f* + Epx f a | 

= 27**4 (12 ' 7) 

v X ) 

in the section t = const. The f v are vectorial functions of 
position in space and have as values ordinary numbers, whereas 
the p, q are scalar quantum mechanical matrices which are 
independent of position and which satisfy the commutation 
rules 

q v p v — p v q v = i, qx px — p x qx = i ; 

all q commute among themselves and all p among themselves, 
and any p commutes with any q whose index is not the same. 
[These rules are perhaps most readily obtained by solving 
(12.7) for the “ Fourier coefficients ” p } q in terms of integrals 
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of scalar products of f, 6 with j^. and applying the commuta- 
tion rules (12.4).] The energy 

of the electro-magnetic field becomes 

+ *-?') + 

We already know the solution of the commutation rules which 
reduces this expression for the energy to diagonal form. The 
individual components of the vector on which the p, q operate 
are distinguished by means of the quantum numbers iV„, corre- 
sponding to the v, and the values of the continuous variables q 
corresponding to the A. On setting q v = y /(X * Qv, Q * s an 
operator which affects only the index N v in accordance with 
the equations 

Q V {N V , N, - 1) = yj^, Q V (N V , N v + 1) - ; 

all other components, corresponding to transitions N v Nl 
in which Ni is neither N v ± 1, vanish. N„ assumes the integral 
values 0, 1, 2, • • ■ and can be considered as the number of 
photons of the kind v. The momentum px associated with the 
continuous variable qx is, following Schrodinger , represented by 

the operator t The electro-magnetic energy is then in 

diagonal form and, on neglecting the (infinite !) null-point 
energy, multiplies the vector component ( N v ; qx) with 

ZvN v + %Zq\. (12.8) 

We thus see how it happens that the electro-static part, which 
is described by the continuous variable qx, is separated off from 
the part due to the radiation, described by the discrete N v 
giving the number of photons of kind v. 

The $ appear in the part of the energy due to matter only 
in combinations of the form $ p tfi*. Consequently it will be found 
advantageous in dealing with electrons to apply the method 
of composition followed by anti-symmetric reduction ; we have 
shown in the preceding section that this procedure is equivalent 
to quantizing in accordance with the rules (12. 4'). Since the 
electro-magnetic quantities commute with the ^ p , $ p they may 
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here be considered as ordinary numbers. The quantized wave 
equations then refer to a “ vector ’’ g with components 

S Pi • ■ • PfS-P 1 ' ' " ^ >n j i ?•*)> 

where P lt • ■ P n are the positions of the n electrons and 
Pi, ' • •, p n are their spin variables, each of which runs through 
the four values 1, 2, 3, 4. We write z H . , Pj as a column 
consisting of 4" terms ; this z is anti-symmetric with respect 
to a permutation affecting the P r and p r alike. @ (r) = 

Sjs r) , Stf) is the spin vector (5 X , S 2l S 3 ) operating only on the 
r th index p r , T <”> is similarly the operation on the r th index p r 
which interchanges i /q, <jj 2 with >p 3> and grad< r) is the gradient 
with respect to P r . The part of the Hermitian energy operator 
— J 0 in the equation 

JJr o -?og = ° (*o = Ct, H = — cj 0 ) 

which depends only on matter is 

2 (@ <f) , \ b ra d (r) + V*ZQ ■ l(Pr) + J z grad ^(P r ) ■ ~) 

r« i N 1 v oqx/ 

+ m 0 ZT(') (12.9) 

r=l 

and to this must be added the electro-magnetic part (12.8). 

Since we have throughout taken the scalar potential / 0 = 0 
we have lost the equation 

div(£ + p = 0 (12.10) 

arising from the variation of /„. This equation contains no 
derivatives with respect to time, and consequently represents 
a condition on the state of the field at a moment t — const. ; 
we must naturally take it into account. On substituting the 
value of @ from (12.7) we obtain 

Z<p + p — 0 

and on multiplying with <f> x and integrating over the space under 
consideration 

q* — \p<f>xdV = 0. 


From the standpoint of quantum mechanics the left-hand side 
of this equation is an operator D x , and the meaning of the 
equation D x — 0 is that only those vectors j which satisfy the 
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equation D x J = 0 are to be allowed. D x also consists of an 
electrical part q x and a material part 

\p dV = J + <?2 'I’i + & ^3 +■ & dF. 

The operator D x which is to be applied to $ is accordingly 

D x = q x — Z<j> x (P r )- 

f -= 1 

The equations D x % = 0 then assert that all components 
z{P r \ N,; q x ) of % vanish except those for which q x = £<f> x (P r ) ; 
we may therefore write the non-vanishing components as 

4>(Pr J N v ) = z[P r ; N v ; lfr(P f )]. 

r =* 1 

But then 


grad« 0 = grad (r) g + £ grad fa(P r ) ■ ^ 

is exactly the combination which appears in (12.9). 27^ is 

now given by a 

I SHPr) M p *) = Z G{P r , P.) 

r, i = 1 A r,*wl 

where 

G{P,P') = ZUP)UP') 

is the ordinary Green’s function for the cavity. We conse- 
quently obtain the quantum equation 


1 chli 

iw 0 -^ =0 


for i/i, in which the operator 

~ gradM) + m 0 ^>| + 5 £ G(P r , P t ) 

V J A r, I — 1 


+ 2W + V«Z 2j(@W, f,(P,)) . Q,}. (12. 1 1) 

In Dirac's theory 


1 
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is the energy operator for a single free particle, a G(P, P') is 
the classical potential due to the electro-static repulsion be- 
tween two electrons situated at P and P' . The next term 
represents the sum of the energies v of the photons in the various 
frequency states v , and finally the last term represents the 
interaction between photons and electrons by emission and 
absorption. The meaning of each of the terms from which 
the energy operator (12.11) is constructed is thus apparent. 
The quantum theory had previously dealt with fields, such as 
that which binds the electron in hydrogen to the nucleus, in 
a manner entirely different from that with which it treated the 
field of the emitted radiation ; the first was calculated classically 
and purely electro-statically as an action at a distance described 
by the Coulomb potential, whereas the second was broken up 
into discrete photons with the aid of Bohr's frequency condition. 
We have now obtained a theoretical justification for this pro- 
cedure which led to good agreement with experiment. 

Our expression shares with classical electro-dynamics the 
disadvantage that it contains the term G(P r , P r ) representing 
the infinitely large reaction of the r th electron with itself, for 
as we allow P' to approach P, G(P, F) becomes infinite like the 
reciprocal of the distance PF. We should therefore replace 
G(P, P) by the finite P (P, P) where 

P(P, F) = G(P, F) L=, 

V ’ V ’ ' in -PP" 

for this amounts to dropping an infinitely large additive con- 
stant from Jq, r(P } P) represents the effect on an electron at 
P of the field obtained by reflecting the field of P in the walls 
of the cavity. (12,11) shows explicitly how the various terms 
of y 0 depend on the value of the fine-structure constant a ; on 
developing the solution in powers of a we are faced again and 
again with infinitely large terms of the same kind as G(P r , P r ). 
The operator J 0 contains singularities which, at the present 
stage, frustrate all attempts to carry through the theory. We 
may indeed conclude with P. Jordan that the problem of the 
existence of the electron is solved, but that that of its con- 
stitution has as yet eluded us. Our equations further suffer 
from the fundamental disadvantage of the Dirac theory that 
the individual spin variables p r assume i instead of 2 different 
values, 23 

There is, of course, nothing to prevent us from quantizing 
the matter waves in a manner analogous to that applied to 
electro-magnetic waves. We should then develop our quantities 
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describing the material field in a series of characteristic 
functions i/j = (with four components) of the Dirac equation 

(j (©, grad) + m o r)^r + ^ = 0 (12.12) 

which constitute, on imposing appropriate boundary conditions, 
a complete orthogonal system. The general component z of 
the vector j, on which the energy — cJ Q operates, will then depend 
on the quantum number w M , which corresponds to the char- 
acteristic values jjL and which may assume only the values 0 and 
1, and in addition on the numbers N v of photons of the various 
frequencies v and on the continuous variables q x . But then the 
operators D X) which commute among themselves and with J 0} 
are not in diagonal form, and the elimination of q x cannot be 
accomplished as in the above method. 

Instead of introducing a cavity as in the above we may 
employ a rectangular parallelepipedon with the “ boundary 
condition ” that all functions are to be periodic functions whose 
periods are the lengths of the sides of the parallelepipedon. 
We can then introduce running instead of standing waves as 
characteristic functions for the electro-magnetic field; this gives 
rise to a better agreement with the physical picture in which 
a photon corresponds to a homogeneous plane wave. The 
energy and the momenta are then also in diagonal form if we 
neglect the interaction between matter and light. Equation 
(12.10) then causes some difficulty, as its right-hand side 0 
must be replaced by the constant mean value of the charge 
throughout the entire space in order that a periodic solution 
be possible. On taking account of protons in the theory this 
will automatically correct itself, as the total charge will then 
beO. 

The dynamical law allows only those quantum jumps of the 
particles in which one n { M falls from 1 to 0 and another jumps 
at the same time from 0 to 1. Consequently the total number 
of particles Zn U) and therefore the charge, remains fixed ; hence 

that portion of the dynamical laws in which the total number 
is a given finite n is separated off from the remaining portion 
and intercombinations between the two do not arise. Dirac 
has proposed to interpret the presence or the absence of a proton 
in the state of positive energy p as the absence or the presence, 
respectively, of an electron in the corresponding negative energy 
state — / 1 ; our laws will then include protons as well as electrons. 21 
Remembering that the numbers = 0, 1 were at first intro- 
duced merely as an arbitrary index indicating the rows of a 
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matrix, there is nothing to prevent us from replacing the numbers 
n for negative — /x byw~ = l — «_ M , keeping for 

positive fi. The theorem of the conservation of charge is then 

— Zn~ — const, (/x > 0). 

But we thereby alter the content, as well as the notation, of 
the theory ; we are now interested in that part of the dynamical 
equations in which only a finite number of w M with positive y. 
are different from 0 and only a finite number of with negative 
H are different from 1 ! The quantum jump of an electron 
between positive and negative energy levels, which was so un- 
desirable in the Dirac theory as formulated in the previous 
section, now appears as a process in which an electron and a 
proton are simultaneously destroyed and as the inverse process. 
The assumption of such an occurrence, for which our terrestrial 
experiments offer no justification, has long been entertained in 
atrophysics, as it seems otherwise extremely difficult to explain 
the source of the energy emitted by stars. 

However attractive this idea may seem at first, it is certainly 
impossible to hold without introducing other profound modi- 
fications to square our theory with the observed facts. Indeed, 
according to it the mass of a proton should be the same as the 
mass of an electron ; furthermore, no matter how the action 
is chosen (so long as it is invariant under interchange of right 
and left), this hypothesis leads to the essential equivalence of 
positive and negative electricity under all circumstances — even 
on taking the interaction between matter and radiation rigor- 
ously into account. 

Having now quantized the field equations, we must return 
to the question of how the constituents M, M' , F of the action 
behave under the substitutions (6.12), (6.13), (6.14). The first 
two substitutions, which we may call (a) and (b), have exactly 
the same effect as before. But the third substitution (c), 
which sends the components of if over into the components 
of ft or their negative, now affects M and M' differently, for 
di and f> are no longer commutative with respect to multiplica- 
tion — they are, in fact, almost anti-commutative. From this 
it is found that M, M', F behave under (c) in exactly the same 
way as they do under (b), i.e. they are multiplied by the signs 
_ 4. respectively. Hence past and future play essentially 

different rdles in the quantized field equations ; we find no sub- 
stitution which leaves these equations unchanged while reversing 
the direction of time. It seems to me that we have thereby 
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reached an extraordinarily important goal of physics. We 
can now obtain the substitution 

/«-*-/« (* = 0,1, 2, 3) 1 

tl 'A* -* #3. ^3 -> ^2. 

J 

on combining (a), (b) and (^) ; this substitution neither affects 
the co-ordinates nor disturbs the quantized wave equations. 
In view of Dirac’s theory of the proton this means that positive 
and negative electricity have essentially the same properties 
in the sense that the laws governing them are invariant under 
a certain substitution which interchanges the quantum numbers 
of the electrons with those of the protons. The dissimilarity 
of the two kinds of electricity thus seems to hide a secret of 
Nature which lies yet deeper than the dissimilarity of past and 
future. 

§ 13. The Energy and Momentum Laws of Quantum 
Physics. Relativistic Invariance 

In quantizing the wave equations the spatial and temporal 
variables were treated so differently that the relativistic in- 
variance of the resulting laws might seem to be open to serious 
doubt. • But a thorough investigation due to Heisenberg and 
Pauli reassures us on this point. 25 We carry through these 
considerations on our action principle — but in such a way that 
the general validity of the argument may be readily seen. At 
the same time this offers an opportunity to discuss the meaning 
of the quantization more thoroughly than we have done hitherto. 

L The Energy and Momentum Laws of Quantum Physics . 

We begin with the 4 + 3 + 3 operators f Pt E p which 
are functions in 3- dimensional space satisfying the commutation 
rules (12.4) and the supplementary rules there set forth. There 
exists one, and in the sense of equivalence only one, irreducible 
solution of these conditions. From it we obtain the energy 
density $ defined by (6.5), (6.6) and integrate it over all of 
space : 

Jo=\tUV. 

We next construct the “ commutator ’’ 

$0=[?o, 0]+(? o 0-07o) 

% 


(13.1) 
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of an arbitrary operator with J 0 . Consider the result of this 
for the particular operators = if* p , E v ; it should be possible 
to evaluate these commutators using (12.4) and the supplement- 
ary rules alone ; if one of the quantities involved appears as a 
derivative with respect to a spatial co-ordinate it should be 
transformed by integrating (13.1) by parts — or by deducing 
commutation rules for it from (12.4) in terms of appropriately 

defined derivates of the 8 function. If is that process 

involving only differentiations with respect to the spatial vari- 
ables, but which coincide with the derivative with respect to 
time in virtue of the Maxwell-Dirac field equations, we find 





ocE p j 

8E, = ^ (13.2) 


We now drop the normalization f 0 = 0. It follows from these 
equations that 8$ for any gauge invariant operator <t> coincides 
with its time derivative as defined in terms of its spatial deriv- 
atives by means of the field laws. We may therefore replace 
the Maxwell-Dirac field equations by the quantum mechanical 
dynamical law 


1*8 
i Ax o 


?o8 


(13.3) 


g represents the probability state of the physical system (pure 
state !) at the time x Q ; it is a vector of that vector-space in which 
our operations take place. The fundamental concepts here 
involved are contained in the general programme of quantum 
mechanics as set forth in II, § 7. The “ density of electricity 
at the point P ” is, for example, represented by the operator 
p = + + + which is independent of time. The changes 

in the probability distribution for this physical quantity in 
course of time are due to the changes in the state g and not to 
changes in p itself ; the rule for the calculation of this probability 
distribution from p and g is given in the general programme 
referred to above. The same remarks apply to any gauge 
invariant quantity $. However, it is more desirable to con- 
sider the “ density of electricity ” (without specifying either 
time or position) as a fixed physical quantity represented by a 
definite operator p, and to ascribe the variations in its prob- 
ability distribution in time and space to changes in the prob- 
ability state g considered as a function of the spatial co-ordinates 
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x lt x t , x 3 in addition to the time x 0 . We should then expect to 
find four equations 

(a = 0, 1, 2, 3) (13.4) 

in place of the one (13.3) in which the operators 

tdV 

are those representing energy and momentum. Only now that 
we have formulated the general scheme of quantum physics 
in a manner which is symmetric with respect to the spatial 
and temporal co-ordinates, as required by the theory of relativity, 
can we consider it as complete. In order to determine the 
mean value of a quantity such as the electric density p we must 
assign to the spatial co-ordinates x lf x 2) x 3 , on which the operator 
p depends, any definite values (e.g. 0). The spatial com- 
ponents of equation (13.4) tell us that the replacement of ( x £) 
by a neighbouring point ( x £ -f dx v ) amounts to the same thing 
as subjecting the normal co-ordinate system in system space, 
to which the vectors j are referred, to the infinitesimal rotation 

dxi -j- J 2 dx 2 -f J z dx 3). 

♦ We must not forget that the equation (13.3) is not equivalent 
to the complete set of field equations, for we have omitted the 
one 

a(P) m div 6 -f p = 0 

which does not involve differentiation with respect to time. We 
must therefore restrict ourselves to vectors $ which satisfy all 
the equations 

*{P) i = 0. (13.5) 

These equations define a linear sub-space 9l<r of the original 
system-space Sft. The operators cr(P ), <r(P') associated with any 
two points P, P f of space are commutative : 

a(P) a(P') - a(P') a(P) = 0. 

It is of prime importance that a(P) commute with J 0) i.e, that 
Sa - j(y, a - oj 0 ) = 0 ; 

If 

that this is the case follows from the fact that the equation 

J)(T 

— = 0 is a consequence of the remaining field equations in 
the classical field theory, and consequently — independently of 
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our field equations — we may conclude that the gauge invariant 
operator a satisfies the equation 8 cr = 0. This commutativity 
of a(P) and guarantees that the infinitesimal rotation tj^dx^ 
of system-space during the time interval dx 0 does not carry the 
vector g lying in the sub-space 3L out of 9L-. 

Continuing our programme, we now set 

J ,= \t\dV 

and investigate the “ commutator ” 

80 = [J,, 0] 

of an operator 0 with J x ; we shall denote this commutator by 
S 1 whenever confusion might arise between it and the commutator 
8 == S 0 with jf 0 . We find the equations * 



(13.6) 

From this it follows that for any gauge invariant quantity 0 

^ 0 

we have 80 = — on taking the equation a = 0 into account. 

Hence the way in which gauge invariant quantities depend on 
the spatial co-ordinates can in fact be described as we predicted : 
the operators representing them are constant, but the vector 
l representing the probability state varies in space in accordance 
with the equations (13.4) for a = 1, 2, 3. 

That the four equations (13.4) are consistent also follows 
from these considerations. In the first place we have 

8^=0 or 0 

in the entire space SR ; this follows from (13.6). In the classical 
field theory the differential conservation theorem 



is a consequence of the field equations. Since t\ is a gauge 
invariant, it follows that after the quantization the operators 
satisfy the relation 



* In contrast with (6.2) we now employ the letter without the factor 
1 /a, as an abbreviation for curl f. 
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in the space SR,, defined by (13.5). Integrating over the space 
x 0 = const, we obtain 

S 0 \t\dV = 0 or ft ft - ft ft = 0. (13.7) 

[The equation which takes the place of (13.7) for the entire 
space SR is 

ftft-ftft = f cE x dV.] 

Furthermore, 


in SR,,, and on integrating this over space we find 

s ij&V = 0 or ftft - = o. 

We thus see that the operators ft are commutative in SR,, and 
consequently equations (13.4) possess one and only one solution 
i when the initial value of g (i.e. at the origin of the space-time 
co-ordinate system) is a given vector in SR,. 

II. Relativistic Invariance. 

On transforming from the normal co-ordinate system x x in 
space-time to another x' a by means of a Lorentz transformation 

3 

A: x a = Zo^Xf, 

( 5-0 

the solution of the equations 

is, as we shall show, obtained from the solution of (13.4) by 
means of a unitary transformation U induced in system-space 
by A. That is, there exists a unitary transformation U such 
that 

-AUi) = {EJJAm) 

1 a 

is satisfied in virtue of (13.4) : 

u • ZJ x dx a = ZJ dx'f u 

at . fi 

or 

ft • U. (13.8) 
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We could also say that (13.4') have the same solution g as (13.4) 
but that the normal co-ordinate system employed in system- 
space has undergone the unitary rotation U, for the vector [7g 
has the same components with respect to the new co-ordinate 
system as g had with respect to the old. We are only able to 
give the transformation U explicitly for infinitesimal A : 

IM = i + !ISM; u=i + -.m. 

I 

The equations (13.8) which are to be verified are then 
0 

In particular, the operators in system-space which correspond 
to infinitesimal rotations in physical space are, as we have 
long known, those representing moment of momentum ; that 
8 M corresponding to the infinitesimal rotation D x : 

8x 0 = 0, 8% = 0, S* 2 = — x 3 , Sx 3 = x 2 (13.9) 

about the # r axis is the ^-component of moment of momentum : 

( M 1 = )M 23 = { (x 2 t ° 3 - x s 4)dV. (13.10) 

The infinitesimal Lorentz transformations which actually repre- 
sent a re-partitioning of the world into a new space and a new 
time are dealt with in exactly the same manner ; it will suffice 
to consider as typical of such transformations 

8* 0 == x h 8x x = x 0 , Sx 2 = 0, S* 3 = 0. 

The 8 M associated with this transformation is 


M 10 = j Xl t° 0 dV + $*„ ndV; 

the second term, which vanishes for x 0 = 0, can be omitted, 
for we have already shown that commutes with all This 
term does not fit into the present scheme, in which all the 
operators are functions of x X) * 2 , alone. Our problem is thus 
reduced to showing that in 9t<r 


[M *3, yj = o, 0, 


? 3 > 

0 , 



for a = 0, 1, 2, 3. 


(13.11) 

(13.12) 


Furthermore, the invariance of equations (13.5) which define 
the sub-space will be proved by showing that the equations 

[M 23 , cr] = 0, [M m a] = 0 (13.13) 


hold in the entire space 91. 
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In order to prove (13.11) we make use of the identities 

[ ? . ( . ** = 0 [« = 1, 2, 3], 

Introducing the Kronecker 8, ft , the integrand may be written 

In consequence of ar = 0 and since t — t \ , t\ are gauge invariants 
the operations 

may be replaced by Sj = [J^ /], 

OXot 

whence 

(S* 2 % - 8*3 ?,) + 8* J (*, $ - * 3 t°)dV = 0 
or 

8A = [&, A 3 ] = S«3 % - S«2 ? 3 [*=1,2, 3], 

In the classical field theory the conservation law 

d(#2 ^3 #3 Q 

a *= 0 ~~ 

is a consequence of the field equations, whence on quantizing 

8,(*,4 - x 3 t °) + = 0 

holds identically in SR*. Integrating over the whole of physical 
space we obtain 

8 0 M 2 z — [Joj -^23] = 0 ; 
equations (13.11), i.e. 

yj = S, 2 y 3 - 8.3?, [« = 0, 1, 2, 3], 
are thus completely verified. 

The relations (13.12) are obtained in an analogous manner 
from 

= ° [ for « = 1. 2, 3] 

and from the equation 

f{s 0 (*i 0 + tl}dv = 0 
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which parallels the conservation theorem 


p(*i ® 




dV = 0 


of the classical field theory. 

We should expect the operator functions expressed by the 
\j} p) f V) E v , depending on the spatial co-ordinates, to be in- 
variant if we associate with an infinitesimal rotation of the 
spatial co-ordinate system an appropriate linear transformation 
of the components i fi p among themselves and of the vector 
components f V) E p , and at the same time subject the normal 
co-ordinate system in system space to the corresponding 
unitary transformation. In formulae : We expect the process 

80 = [M n , 0] 

to yield the equations 

w = S'</f - 

8/» = 3 % + (U- 8.3/2), 

BE V = 8'£„ + (S vi E 3 — Sj, 3 £ 2 ), 


where we have written 


S'0 = 


80 

8*3 


80 
x 3 - — . 
8#2 


But we find by direct calculation that 
8</> = 8'i/i + i(x.J 3 — x 3 f 2 )iji - 

8/l ^ #2 + #3 Hq , 8/2 — Hi , 8/3 = Hi , 

SE P = + 8„ 2 (.E 3 # 3 or) — S V3 (E 2 + ^'2 °")* 


We first observe that these equations yield 

So- = [ikf 23) a] = 0 


independently of the condition a = 0. On introducing the 
condition cr= 0 we find from these equations that gauge in- 
variant quantities <P exhibit the expected behaviour. The 
second of the equations (13.13) can be obtained by an analogous 
computation. 



272 


APPLICATIONS OF GROUP THEORY 


D. Quantum Kinematics 

§ 14. Quantum Kinematics as an Abelian Group of 

Rotations 

If we consider the operators ip, iq as infinitesimal unitary 
rotations of the ray field in system space, then Heisenberg’s 
commutation rules [II, (11.4)] assert that these rotations are 
commutative ; consequently they generate a 2/-parameter 
Abelian group, where / is the number of degrees of freedom. 
Let us therefore investigate the properties of Abelian groups 
of unitary rotations in the ray field of n-dimensional space ! 
On introducing a gauge as in III, § 16, to each such 14 rotation ” 
there corresponds a transformation of vector space with matrix 
A and between any two matrices A, B there exists an equation 
of the form 

AB = zBA. (14.1) 


This equation is possible only if e is an n th root of unity, for on 
evaluating the determinant of both sides we obtain 8" = 1. 
From (14.1) we obtain by mathematical induction 


A k B = s k BA k ,\ 
AB 1 = t l B l A , J 


(14.2) 


for k, l = 1, 2, 3, • • \ On combining these two equations by 
applying the second to A k and B instead of A and B we find 
the general rule 

A h B l = e* l B l A K (14.3) 


Taking k = n in (14.2) we are led to the equation A n B = BA n ; 
if the Abelian rotation group is irreducible Schur’s fundamental 
lemma allows us to conclude that since A n commutes with all 
elements B of the group it must be a multiple of the unit matrix : 
A n ~ 1. The order of any element of an irreducible Abelian 
rotation group in n dimensions is consequently a factor of n. 

An /-parameter continuous rotation group is generated by 
an /-dimensional linear family g of infinitesimal unitary corre- 
spondences 

°i Ci + °2 ^2 ~h * • * + oy C f (14.4) 


in terms of a basis formed by any / independent elements 
Ci) C 2l *) C f of the family. The numerical parameters 
a i» * * % oy may assume all real values. Setting a, = a t dr 
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and reiterating the infinitesimal transformation (14.4), we find 
that at “ time ” r the resulting transformation is 

U{cr lf & 2 ? ’ * *, 07 ) = + + + (14.5) 

where we have replaced by 07 . U runs through the entire 
group, which is now expressed in terms of the parameters a. 
If the group of unitary transformations of the vector space is 
Abelian the C v must satisfy the conditions 

CJC 9 - C V C. = 0 . (14.6) 

From this it then follows that all the elements (14.5) of the 
group are mutually commutative, for if AB — BA = 0 we have, 
as in the domain of ordinary numbers, 

e A • e B = e A+B . 

The parameters a in (14.5) are added on composition : 

£7(a„ ■ • o f )U(<r' t , • • •, °y) = + o'i, • • •, of + a'f). 

If, however, only the rotations of the ray space are commutat- 
ive, we find in place of (14.6) conditions of the form 


where the constitute an anti-symmetric system of real numbers. 
The commutator of the infinitesimal transformations with 
matrices 

A = o-j C x + * • • + B — r 1 C 1 + * ’ ’ + r fCf 

is 

AB — BA == iEc^o^ • 1. 

v 


We shall refer to the anti-symmetric form 

2Voyr„ = h(a } r) 

n,v 

as the commutator form ; it is invariant under change of basis. 

On writing 1 + — , 1 + - in (14.3) in place of A, B and allowing 

k = l = co , we find that the commutator of any two 
elements U(cr lt o t , • • •, of) — U(o) and U(t) of the group is 

£7 (<r) U(r) U~ x {a) U~\r) = e[h{o, t)] • 1. (14.7) 

If the rotation group is irreducible a fixed 77(a) can only 
commute with all U( t) if it is a multiple of the unit matrix, 
i.e. if all its parameters a vanish. From this we conclude that 
the commutator form is non-degenerate, i.e. that it cannot 
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vanish identically in r t for a fixed set of values cr iy unless all 
cr t = 0 — this amounts to the same as the condition |c a j =f= 0. 
Such a form exists only if the number / of variables is even, in 
which case it can, by appropriate choice of the basis (i.e. by 
transforming the variables a { and r t cogrediently under an 
appropriate transformation), be reduced to the canonical form 
in which the matrix \\c tk \\ is decomposed into 2-rowed sub-matrices 

0 1 I 

-1 0 I 

arranged along the principal diagonal.* It is then desirable to 
write 2 f in place of / and to denote the “ canonical basis ” so 
obtained by 

iP V) iQ v (v = 1 , 2 , • • *, /) 

and the corresponding parameters by a Vi r v . The factor i has 
been introduced in order to express the results in terms of 
Hermitian operators P v , Q v . The basic elements then satisfy 
the commutation rules 

i(P v Q, - QvP v ) = 1 , i(PM v - Q V P») = 0 

for /J, 4= v and 

PtxPv — PyPp = 0, QnQ v — Q v Qn = 0 
for all /a, v. The elements 

U(cr) = e(a x P x + o 2 P 2 <r f P f ) [e(x) = e ix \ 

then constitute an /-parameter Abelian group of unitary (vector) 
correspondences, as do also the 

V(t) = + r 2 0 2 4- . . . -f- TfQf), 

But the commutator of elements C/(a), V(t) belonging to these 
two sets, respectively, is 

U(o)V(t)U-'(*)V-'(t) = + . . . + G/ r f ) . 1. 

We have now carried our development to a point where we 
can profitably return to the considerations of II, § 11. In 
the case of a system with one degree of freedom in classical 
mechanics any physical quantity associated with the system 
is expressed mathematically as a function f{p ) q) of the canonical 
variables p, q. In making the transition to quantum mechanics 
we had previously restricted ourselves to polynomials in p q . 
But the Fourier representation 

+00 

9) = J J e [°P + Tq) £(o } r) da dr 

—00 

* See Appendix 3. 


( 14 . 8 ) 
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of a function / is applicable to a much larger class of functions ; 
this integral need not be interpreted literally, the essential 
point being that it represents a linear combination of the simple 
functions e(ap + r q). On considering ip, iq as infinitesimal 
unitary correspondences in ray space which are commutative 
in accordance with the relation 

i{pq - qp) = 1, (14.9) 

e(op + rq) runs through the group generated by them. If we 
now consider £(<r, r) as the components of an element in the 
resulting group algebra, then (14.8) is its group matrix in the 
representation obtained by associating with ( a , r) the unitary 
transformation e(ap + rq). This group matrix is Hermitian if 
the element is real, i.e. if 

£(<*, T ) = £(— or, — t). 

A quantity / is consequently carried over from classical to 
quantum mechanics in accordance with the rule : replace p and 
q in the Fourier development (14.8) of f by the Hermitian operators 
representing them in quantum mechanics. In particular, the 

derivatives of f are represented by 
+ 00 

fv = tj \ e { a P + rq) ■ a£(a, r) da dr, 

— 00 
+00 

/« = *5 J e(<rp + rq) • t£(<t, t) da dr. 

— 00 

On letting U(t) in (14.7) again in infinitesimal we find, with 
the aid of the commutation rules (14.9), that 

p * e[op + rq) — e(c rp -f- rq) • p == r * e[ap + r^) ; 
q * e(ap + r q) ~ e(<r p + rq) • q = — a • e[op -f- rq). 

We therefore have in general 

*/* = ?*/-/•«. = — 

as required in order that the Hamiltonian equations 

TT dp TT 

dt - dt 

be equivalent to the quantum-theoretical equations of motion 
for the vectors of system space. 

We have thus found a very natural interpretation of quantum 
kinematics as described by the commutation rules. The kine- 
matical structure of a physical system is expressed by an irreducible 
Abelian group of unitary ray rotations in system space. The real 
elements of the algebra of this group are the physical quantities of 
the system ; the representation of the abstract group by rotations 
of system space associates with each such quantity a definite 
Hermitian form which “ represents ” it. If the group is con- 
tinuous this orocedure automatical^ leads to Heisenberg's 
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formulation ; in particular, we have seen how the pairs of 
canonical variables then result from the requirement of irre- 
ducibility, whence the number of parameters in such an irre- 
ducible Abelian group must be even}* 

If one of the canonical co-ordinates, say q } is a cyclical 
co-ordinate with period 27 r, then all quantities of the physical 
system are represented by periodic functions with period 2n, 
Consequently the only values assumed by the parameter r 
associated with q in (14.8) are multiples of 2 tt and the integral 
is to be replaced by a sum. In such a case we are no longer 
dealing with a continuous group, but with a mixed (continuous- 
discrete) group. 

Our general principle allows for the possibility that the 
Abelian rotation group is entirely discontinuous, or that it 
may even be a finite group. Thus we have discussed in III, 
§ 16, a group of order 4 and an irreducible ray representation 
33 of it in 2 dimensions. That such groups actually occur in 
Nature is shown by the fact that the group we have just men- 
tioned characterizes the kinematics of the electron "spin dis- 
cussed in § 4. It can be readily shown that 33 is the only 
irreducible representation of this group, and that it is in fact 
the only irreducible 2- dimensional group of unitary rotations in 
ray space. These results emphasize the remarkable nature of 
this simplest case. The quantization of the problem of several 
electrons discussed in § 11 also falls within our general scheme. 
In dealing with it we are interested in that Abelian group whose 
basic elements p m (a = 1, 2, • • •, 2/) are all of order 2 ; such 
a group consists of the totality of the 4 / different elements 

P'tPt • • * P7 f ( n « = 1 or 0). 

The gauge can be so chosen that the corresponding unitary 
matrices p* in the irreducible ray representation in 2f dimensions 
satisfy the equations 

pi — 1. PtP« = - Papfi (a 4= /?). (14.10) 

The kinematics of the spinning electron is described by the 
simplest case / = 1 of this representation. 

Because of these results I feel certain that the general scheme 
of quantum kinematics formulated above is correct. But the 
field of discrete groups offers many possibilities which we have 
not as yet been able to realize in Nature ; perhaps these holes 
will be filled by applications to nuclear physics. However, it 
seems more probable that the scheme of quantum kinematics 
will share the fate of the general scheme of quantum mechanics : 
to be submerged in the concrete physical laws of the only existing 
physical structure, the actual world. 
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§ 15. Derivation of the Wave Equation from the 
Commutation Rules 

We now show by actual construction that there exists but 
one irreducible ray representation (excluding the identity) of 
a 2-parameter continuous Abelian group : namely, that one 
which leads to the wave equation. 

We obtain our 2-parameter continuous group as the limiting 
case of a finite group with 2 basic elements ; our proof is rigorous 
only insofar as the validity of this limiting process is admitted. 
Let A ) B be two commutative rotations of an w-dimensionai 
Unitary space. On introducing the gauge we have an equation 
between their matrices : 

AB « eBA, (14.1) 

in which, as we know already, s is an n th root of unity. The 
system consisting of the two matrices A, B shall be irreducible. 
Let their commutator, the number s, be a primitive m th root of 
unity, i.e. e m is the lowest power of £ which is equal to 1 ; m is 
then a factor of n. The orders of the rotations A , B are also 
factors of n : A n — 1, B n — 1, so the gauge may be chosen in 
such a way that A n = 1 , B n = 1. Let B be reduced to diagonal 
form by an appropriate choice of our normal co-ordinate system ; 
the elements b { in the main diagonal are then all n th roots of 
unity. Equation (14.1) then yields the following conditions on 
the elements of A = ||a ifc ||.: 

Y “ik = s a ik . (15.1) 

We divide the indices i and the corresponding variables x £ 
into classes in accordance with the rule that i and k belong to 
the same class if the quotient bi/b k is an m th root of unity, i.e. 
a power of e. That this process really results in such a division 
into classes is shown by the fact that if bi/b k and b k jbi are powers 
of e, then b^b t is also. By (15.1) a ik = 0 if i and k belong to 
different classes ; hence the matrix A is reduced in accordance 
with the division of the indices into classes. But in view of 
the assumption that the system A, B was irreducible there can 
therefore exist but one such class. 

Having established this result, we now proceed to a finer 
division into classes : i and k shall now be considered as belonging 
to the same class if b { = b k . We arbitrarily choose as the first 
of these classes that one for which &* = b and let the second 
consist of those for which b { = eb, the third with b { = e 2 fr, • • *, 
the m ih with b { = ; this exhausts the set, for the (m + l) 8t 
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class bi = s m b coincides with the first. Let the variables be 
arranged and numbered in this order. It then follows from equa- 
tion (15.1) that all sub-matrices (i, k) of the matrix A are empty, 
i.e. a ik = 0, unless their row index i and their column index 
k belong to successive classes. The matrix A then has the 
form indicated in Fig. 3, in which all elements in the non- 
shaded portions are zero (and we have taken m = 4). The 
shaded portions are occupied by the sub-matrices A (1 \ A&), 
• • *, A< m l Since A is unitary the sum of the squares of the 
absolute values of the elements of a row or column is 1 ; the 








vv\\\v\ 





A (3) ' 

' A s 






Fig. 3. 


same must therefore also hold for the rows and columns of 
each of the sub-matrices. The sum of the absolute values of 
the squares of all elements in A W must then be equal, on the 
one hand, to the number of rows and, on the other, to the number 
of columns ; the rectangle AW is consequently a square, and 
the number of indices in the second class is equal to the number 
in the first class, say d. By the same argument we see that 
the number of individuals in each of the m classes is d, and hence 
n = md. The figure is to be corrected accordingly ; each of 
the shaded matrices is now unitary. On subjecting the variables 
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of the first class to the unitary transformation with matrix 
A the sub-matrix A M is reduced to the ^-dimensional unit 
matrix. This normal form is undisturbed by a unitary trans- 
formation affecting the variables of the first set and the variables 
of the second set alike ; we can therefore reduce the second 
sub-matrix to a multiple of the d - dimensional unit matrix, and 
so on through the (m — l) st . The normal form so obtained is 
unchanged on subjecting the variables of each class to the same 
^-dimensional unitary transformation ; we may therefore choose 
as this last transformation one which reduces AW to diagonal 
form. But the matrix A is then decomposed into d-sub-matrices, 
as can be seen by renumbering the variables, taking first the 
first members in each set, then the second, etc. The irreduci- 
bility assumption then tells us that there can be but one member 
in each set : d = 1, n = m. Our matrices are now in the normal 
form : 



0 1 


£ r 


0 X 


e r+ 1 

A = 

0 1 

, B = 

e r + 2 


a 0 0 0 • • • 0 


gn+-r— 1 


all elements not explicitly indicated are zero. The exponents 
in B are n successive integers and s is a primitive n th root of 
unity. Finally, the equation A n = 1 yields a = 1. We number 
the variables from r on and take indices which are congruent 
mod. n as equal ; the two correspondences are then 

A: x k = x M , B : x k = e k x k . 

On reiteration we find 

A ' : 4 = ***, B i : x' = (15.2) 

The transition to continuous groups is now accomplished by 
passing to the limit n -> oo. Let the basis iP ) iQ of the con- 
tinuous 2-paramcter Abelian rotation group be normalized in 
accordance with (14.9). We identify the matrix A of the above 
considerations with the infinitesimal e{i;P) and B with e[r\Q) 
where £ and 77 are real infinitesimal constants. Then e(aP) = 
A s , e(rQ) = B f when in the limit si; -> a, trj->r. s is now 
e((rj) and e ki = e(£kr). e(rQ) represents the physical quantity 
e T ? ; the values which it may assume are given by where 
t is real and k runs through all integral values. In other words, 
the quantity q may assume the values kij ; q may assume all 
real numbers from — 00 to -fi* 00 . (Of course k is to be con- 
sidered mod.n and k £ mod .n£, but nt ; is a multiple of 2*77/17 
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and may consequently be infinite in the limit.) We therefore 
write q in place of where q is understood to be a variable 
which runs through the possible values of the physical quantity q , 
and Vi * $(q) in place of x k . if/(q) is an arbitrary function, 
whose values are complex numbers, which satisfies the normalizing 
condition 

l\t(q)\>dq=L 

On passing to the limit in the second equation of (15,2) we 
find that the quantity e %X( t is represented by the linear operator 

0 (?) eiTq • 0 (?)- 

Similarly we find from the first equation of (15.2) that 

iW?) */*(? + *) 

is the operator representing e i<r P. On returning from finite to 
infinitesimal unitary transformations we find 

9 «•*(*), pitKq)-\%- (15.3) 

We have thus finally justified the assumption from which we 
started in Chapter II. 

The extension of these results to systems with several degrees 
of freedom causes no trouble. The kinematics of a system which 
is expressed by a continuous Abelian group of rotations is conse- 
quently determined uniquely by the number f of degrees of freedom . 
The postulate of irreducibility allows us to conclude that the 
particular operators (15.3) of the SchrOdinger theory are a 
necessary consequence of Heisenberg’s commutation rules. 27 

P, Jordan and E . Wigner 28 have given a very elegant group- 
theoretic proof that there exists but one irreducible matrix 
solution of equations (14.10), i.e. that one of degree 2f there 
mentioned and given in greater detail at the end of §11. 



CHAPTER V 


THE SYMMETRIC PERMUTATION GROUP AND THE 
ALGEBRA OF SYMMETRIC TRANSFORMATIONS 

A. General Theory 

§ 1. The Group Induced in Tensor Space and the 
Algebra of Symmetric Transformations 

r HE principal problem we propose to solve in this chapter 
is the group -theoretic classification of line spectra of an atom 
consisting of an arbitrary number , say /, of electrons , 
taking into account the reduction of the space ffi to as re- 

quired by the Pauli exclusion principle , and the spinning electron. 
For this it is necessary to consider in detail the representations 
of the symmetric group, i.e. the group 7 77 of all /! permutations of 
/ things. These are most intimately related to the representa- 
tions of the group U of all unitary transformations or the group 
c of all homogeneous linear transformations of a space $R n . 
This connection has already been touched upon in Chapter III, 
§ 5 : the substratum of a representation of c or it consists of the 
linear manifold of all tensors of order / in 9t n which satisfy 
certain symmetry conditions, and the symmetry properties of 
a tensor are expressed by linear relations between it and the 
tensors obtained from it by the/! permutations. 

A tensor F of order /in the n-dimensional vector space SI = 
is defined by its n-f components or, as we prefer to say, “ co- 
efficients ” F(i x i 2 • • * i f ) ; each of the indices i runs from 1 to n . 
Tensors can be added and multiplied by arbitrary numbers ; 
hence the totality of such tensors F constitute a linear “ vector 
space ” ffl of nJ dimensions. Further, F can be subjected to 
an arbitrary permutation s of its / indices, which can be thought 
of as a permutation of the / numbers 1, 2, * • *, / attached to 
the indices i in the general component above ; if s is the per- 
mutation 


1 - 1 ', 2 -> 2 ', • • •,/->/' 
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then the tensor sF obtained by applying s to F is, by definition, 
that tensor whose coefficients are 

sF(i x i 2 • • ■ i f ) = F(i v i w • • • i f ). (1.1) 

It follows from this definition that for any two permutations 
s and t 

t(sF) = (ts)F. 

A linear correspondence F F' : 

F'(h • • • if) — Sa{i x i, \ K ■■ ■ k f )F{k 1 • • • k f ) (1.2) 

(A) 

is said to be symmetric if the coefficient 

«(*i •••*/; K •■•*/) 

is unaltered on subjecting the sub-indices 1, 2, • • *, / of both the 
indices i and k to the same arbitrary permutation s. The pro- 
cesses of addition, multiplication by a number and permutation, 
in the sense defined above, applied to tensors are invariant 
under symmetric linear transformations ; and conversely, any 
transformation of tensor space under which these processes 
are invariant is linear and symmetric. The totality of symmetric 
correspondences constitutes an algebra 2 : \i A and B are ele- 
ments of 2 then A + B, AB and cA ( c an arbitrary number) 
are also. The problem with which we shall concern ourselves 
is the reduction of ffi into linear sub-spaces *p which are in- 
variant with respect to 2, i.e. with respect to all symmetric linear 
transformations. Wherever in the following we employ the 
terms invariant, irreducible, etc., in referring to the tensor space 
ffl, they are to be interpreted with respect to the algebra 2. 

We give a brief r6$um6 of our terminology. We are dealing 
with a vector space SR and a system 2 of linear correspondences 

s-* = 

of 91 on itself ; we may often prefer to use the term 44 linear 
projection ” instead of 44 linear correspondence (operator) ” in 
order to bring out the fact that the correspondence need not 
be one-to-one. A (linear) sub-space of 9i is invariant if an 
arbitrary projection A of the system 2 sends every vector 
j of over into a vector of is irreducible if it contains 

no invariant sub-space other than itself and the space 0 con- 
sisting only of the vector 0. We shall always understand by 
a complete reduction $ = of the invariant sub-space 

a complete reduction into two linearly independent invariant 
sub-spaces ^ 2) even when this is not explicitly stated. A 
linear projection j $' of the invariant sub-space 5J5 on the 
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invariant sub-space is similar if two vectors £ and tj of ^ 
which are related by a correspondence A of the system ; 
are always projected into two vectors j' and t)' of which are 
related by the same A: k)' == A ^ and are similar or 

equivalent : ^ $ if a one-to-one linear and similar corre- 
spondence can be set up between and In particular, 

these concepts are to be applied to the case in which the vector 
space is the tensor space ffi = of vf dimensions and Z is 
the totality of symmetric transformations. 

In quantum theory the state of a system consisting of / 
equivalent individuals (electrons) with a system-space is 
described by a tensor of order / in The energy necessarily 
depends on each of the / individuals in exactly the same way ; 
hence the Hermitian operator which represents the energy is 
necessarily symmetric in our sense. The fundamental dynamical 
law therefore allows us to conclude that an invariant sub-space 
of ffl has the property that if the tensor describing the state 
of the system is at any time in no influence whatever can drive 
it out. A complete reduction of ffi into invariant sub-spaces 
implies a corresponding reduction of the operator representing 
the energy ; hence the term spectrum is reduced into classes 
of terms belonging to the various such that the members of 
one class can under no conditions combine with the members 
of another. Naturally this division into non-combining classes 
is to be carried as far as possible. But this problem is exactly 
the one proposed above — the only difference being that we are 
here only concerned with the totality Z (h) of symmetric Hermitian 
operators. However, this restriction is quite irrelevant, for 
any symmetric operator can be written in the form A — A x + iA 2 
where 

A, = \{A + A), A 2 =±(A-A), 

are both Hermitian. 

On going over to a new co-ordinate system in the fundamental 
vector space by means of a non-singular transformation 

%i = Z a(ik)x k (1.3) 

the coefficients of a tensor F are transformed in accordance with 

F'ihh * * * h) = * • * alifkf) * F(k x k 2 •••&/) 

(*) 

(1.4) 

The transformation (1.3) in vector space induces the symmetric 
transformation (1,4) in tensor space. These induced trans- 
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formations, which wc shall call u special symmetric transforma * 
lions” constitute a group E 0 which is isomorphic with the com- 
plete linear group C » c n ; this representation of c was previously 
denoted by (c)A The group E 0 is contained in the algebra E. 

1 tenc'o a sub-space of ffi which is invariant under the algebra 
£ is a fortiori invariant under the group Z 0 . That the converse 
of this result is also valid is not so self-evident. Nevertheless 
for all questions involving only linearity E 0 can be replaced by 
the more extended 2 } for E is what we might call an enveloping 
algebra for the group E 0 ; by this we mean that any symmetric 
transformations can be expressed as a linear combination of 
appropriately chosen special symmetric transformations. 1 To 
show this we prove the theorem : 
el homogeneous linear relation 

£ •••*/; k !*■•*/) *(h •••»/; *1 •••*/) — o (1.8) 

0 ; *> 

is satisfied identically by all symmetric transformations 


M*» ' * * V ; K ' ' ' k f )\ t, 


if it is satisfied by all special symmetric transformations, i.e. if 
t lit* equation 

•••*/; ki ■ - • k f )x(ijk i) * • • *(t A) = o (1.6) 

«;*> 


is satisfied for all values of the n a variables #(i&) for which 
the determinant | 4= 0. 

Proof. Denoting the pair ( ik ) of indices by j and calling the 
«* m values of j simply 1, 2, • • •, tn, the left-hand side of 
(1,6) is a homogeneous polynomial of order / in the m variables 
x{ik) x , : 

<f>[X\X X * • * X m ) Sb[fi, ft, ' ’ fm)X?\ x 2 ‘ ‘ X m 

(/) 

fl 

where /j I /* f h / m /and b(f h / 2 , • % ,f m ) 15 JfJfl . . .f m \ 

times that eoeilicient <•(;,;, • • • j,) whose indices contain j — 1 
f times, j 2/, times, etc. On denoting that variable x{jjt ••■]/) 

m which the indices; «« 1, 2, • • *, m occur f x , f t ••*,/« times by 
/ . . . / ) the left-hand side of equation (1.5) becomes 

muf* * ■ *» * • '.u 

(/) 


The determinant of the x(ik) is a certain polynomial D{x \x x • • • #«*) 
in the variables Our assertion is thus reduced to the well- 
known theorem of algebra: let <f>(x), D(x) be two polynomial 
in the variables x t *, * • * #*, the second of which does not vanish 
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algebraically, i.e. its coefficients do not all vanish. If <j>(x) is 
zero for all values of the variables for which the value of 
D(x) =# 0, then <f>(x) vanishes algebraically. 

This theorem is proved for a single variable x as follows. 
If <f>(x) does not vanish algebraically it has a definite degree 
p^ 0; let q be the degree of D(x). There are then at most 
p + q values of the variable x for which <f>(x) or D(x) vanish ; 
for any one of the remaining infinitude of possible values of 
x neither <f>(x) nor D(x) can vanish, contrary to assumption. 
The theorem is readily extended to polynomials in any number 
of variables by mathematical induction. The principal point 
is that the analytical vanishing of a polynomial for all values of 
the independent variables implies that it vanishes algebraically. 

In quantum theory the vector space 91 is unitary : the transi- 
tion from one normal co-ordinate system to another such is 
accomplished by an arbitrary unitary transformation (1.3). 
The transformations thus induced constitute a sub-group 2^ M) 
of S Q which is isomorphic to the unitary group u n , i.e. the 
representation (u Y of the unitary group. I assert that a sub- 
space of ffl which is invariant and irreducible with respect 
to 2 remains irreducible not only under the group 2 0} but under 
the more restricted group 2^ u) as well. To prove this we must 
show that the identity (1.5) holds even when we assume only 
that (1.6) is true for those values of the variables x(ik) with 
unitary matrix. 

One of the most natural proofs of the above theorem con- 
cerning the formal vanishing of a form <f> of order / depends on 
the process of “ polarization ” : we assign arbitrary infinitesimal 
increments dXj to the values of the variables x j ; the identical 
vanishing of <j) then allows us to conclude that the differential 


y H 

7 ix, 


dxj 


vanishes for arbitrary values of Xj and dxj. This procedure 
also leads us to the desired conclusion in the case under con- 
sideration. Denoting by & the matrix obtained by transposing 


rows and columns in 


we have 


'dx(ik) 

tr (0dX) 


where X , X + dX are two arbitrary neighbouring unitary 
matrices. In order that this be the case we must have 


dX = iX ■ 8X 
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where 8X is an arbitrary Hermitian matrix : the “ rotation ” 
X + dX is obtained by following up the rotation X with the 
infinitesimal rotation 1 + i • SX. But the equation 

tr (<PX -8X) = Q 

implies the vanishing of <PX. This is seen immediately from 
the fact that a linear form 

Jc Vik 

in the variables y ik = S x(ik) vanishes identically if it vanishes 
for all values satisfying the condition y ki = y ik ; indeed, any 
matrix Y = ||y ifc || can be written in the form Y x + i Y 2 where Y x 
and Y 2 are Hermitian. On multiplying the right-hand side 

of @X = 0 by X 1 we find $ = 0: all derivatives 

vanish in the same sense as <f> itself, i.e. for arbitrary x{ik) whose 
matrix is unitary. But these derivatives are forms of order 
/— I ; the truth of our assertion above is thus proved by 
mathematical induction. 

Every invariant sub-space $ of ffl is the representation 
space of representations of the groups c and u which are con- 
tained in (c )/ and (u)^ respectively. Hence the above results 
prove that if is irreducible these representations are also. 

§ 2. Symmetry Classes of Tensors 

One of the most natural methods of obtaining invariant 
manifolds of tensors F consists in subjecting F to linear symmetry 
conditions of the form 

£a(s) • sF = 0. (2.1) 

9 

This suggests introducing the symmetry operator 

a = £a(s) • s. (2.2) 

9 

Such operators can be added and multiplied with arbitrary 
numbers, and two operators a ) b can be applied successively 
with the same result as the symmetry operator c — ba defined by 

c(s) = Zb{t)a[t'). (2.3) 

W *= 9 

In other words, we are here led in a most natural way to the 
algebra p of the symmetric group tt = 7r f of all permutations 5. 
The elements of this algebra, which constitute an /I -dimensional 
linear space t, appear as operators which can be applied to 
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tensors of order /. We may call the numbers a(s) appearing 
in (2.2) the components of the element a. In particular, a is 
an Hermitian operator in the tensor space ffi if it is a real 
element, i.e. if it coincides with its Hermitian conjugate a 
defined by the equation 

a(s) = d(s~ x ). (2.4) 

Hence these real symmetry operators represent physical quan- 
tities of the physical system consisting of /equivalent individuals, 
whose total system space is ffl ; quantities of this kind are 
unknown in classical physics and cannot be pictured in terms 
of the usual spatial and temporal models. 2 

(2.1) or 

£a(s)x($) = 0 

« 

is a linear condition which is imposed on the element x = F 
defined by x(s) = sF. A symmetry class is defined by one 
or more equations qf this kind ; we are thus led to the definition : 

Each linear sub- space £ of t determines a symmetry class ^ 
of tensors. F belongs to ^5 when the corresponding symmetry 
quantity or element F is in p. It will be found convenient to 
denote the process by which is generated from p by a symbol ; 
we write ^ = ftp. 

If the reader finds it difficult to operate with elements F 
whose components sF are tensors rather than numbers he may 
replace the tensor by the totality of its coefficients F(i x i 2 • • * if) 
and F by the elements 

X = F(i L i 2 • • * i f ) 

associated with each definite set of indices (i x i 2 * • ■ if) ; this x 
is defined by the equation 

x(s) = sF(i x n • • • if). 

The requirement that F belong to p means that F(i x i 2 * • • if) 
belongs to p for all the nf possible combinations of the indices i. 
That the symmetry class $ = $p is invariant with respect to 
all symmetric transformations (1.2) is due to the fact that (1.2) 
implies the corresponding equation for the elements F, F'. 
F f (iji 2 • * * if) is a linear combination of the elements F{k x k 2 • • • kf) 
associated with the various combinations (kfk 2 • • * kf) of indices k. 

If F belongs to p then a • F does also, where a is any element 
whatever of the algebra. To show this we note that the 
^-component of 

H(h ’ - ' h) = a ' P (h • • ' h) 
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is given by 

2>(r“ l ) * rsF(i x ■ * • v) = iX*" 1 ) ‘ *^(*i ' ' ' k t) 

r r 

where the • • -, k f are obtained from i ly • * % if by the per- 
mutation r. Hence H (t'i, • • *, i/) is a linear combination of those 
JP(Jki * • • k f ) whose indices k are obtained by a permutation of 
the indices i. 

The principal question now is whether every invariant 
sub-space 5(5 can be generated from a p by the process and 
further, whether or to what extent this generating p is uniquely 
determined by 5(5. The answer is perhaps best expressed with 
the aid of the inverse process \ which generates a p from the 
given 5(5. The following geometrical analogy may be useful 
in enabling the reader to understand the situation with which 
we are dealing. Let the points x of a plane with a fixed centre 
correspond to the elements of the algebra p and the line segments 
F going out from the origin correspond to the tensors. On 
contracting the entire plane, leaving the centre invariant, in 
the fixed ratio r (0 ^ r g 1) the point x goes into the point 
tx and the segment F into the segment tF ; this contraction 
of segments shall, be the analogue of the symmetrical trans- 
formations of tensors. 5(5 will now denote an “ invariant ” 
set of segments, i.e. a set such that if it contains the segment F 
it also contains all the contracted segments tF. Just as we 
associated the symmetry elements P[i t • • ■ i f ) with the tensor F 
we now associate with the segment F the continuum of points 
F(r) of F ; P(r) is the end point of the segment tF. Let p be 
any set of points ; the segment F will then be included in the set 
5(5 = |p if and only if all its points P(r) are in p. Obviously the 
only segment sets 5(5 which can be obtained in this way are 
those which are invariant, and all such invariant sets can be 
so obtained. Only the “ core ” p 0 of the point set p is essential 
to this construction ; p 0 consists only of those points x such 
that rx belongs to p for all r (in the interval 0 ^ r ^ 1). p 0 
is invariant in the sense that with x all rx belong to p 0 . That 
only the core p 0 is essential means that our construction generates 
the same segment set 5(5 from two point sets p, p' if these latter 
have the same core ; hence we can restrict ourselves ab initio 
to the consideration of invariant point sets p = p € . It is extra- 
ordinarily easy to find the point set p which generates a given 
segment set 5(5 : we include in p those and only those points 
lying on the segments of 5(5, and this p is automatically invariant. 

If the reader will think through this geometrical illustration, 
which we have formulated here in such a pedantic manner, he 
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will have no trouble in understanding the analogous situation 
for tensors and symmetry elements. A linear sub-space p of X 
is to be called invariant if all elements ax are in p, where x is 
an arbitrary element of p and a is any element whatever.* 
Hence such a p is invariant under the totality of correspondences 
of the form 

(a) : x -> x' = ax (2.5) 

On associating this correspondence (a) of t on itself with the 
element a we obviously obtain a representation of the algebra p 
(and therefore of the group rr f ) ; it is called the regular 
representation, (r appears here twice: once as the repre- 
sentation space and again as the algebra p represented in this 
space ; the first will be expressed by the German letter r, the 
second by the Greek p. We are here doing the same thing as 
in III, § 2, where we obtained a realization of the group g by 
associating with the element a of g the correspondence s-+ s' = as 
of the group manifold on itself.) This regular representation 
supplies us with material from which we can construct all — 
and hence in particular the inequivalent irreducible — repre- 
sentations of the algebra p. When we use the terms invariant, 
irreducible, etc., in t they will always refer to the algebra of all 
correspondences (a) of t on itself, which is simply isomorphic 
with the algebra p of all symmetry elements a. ,() being an 
invariant sub-space of t, we shall always refer to the representa- 
tion induced in £ by the regular representation simply as the 
regular representation in £ ; it associates with each element a 
the correspondence (2.5) of p on itself. The equation x' = ax 
is, in terms of components, 

x'[s) = 2a(r-*)x(rs). 

r 

Let x be an arbitrary element of p ; the requirement that p be 
invariant allows us to conclude that the element x' defined by 
x'(s) = x(rs) is also in p, where r is any fixed permutation. 

Let p be an arbitrary sub-space of t ; we say that x belongs 
to the core p 0 of p if and only if all quantities of the form ax 
belong to p ; this p 0 is invariant. We thus have the theorem 
that two linear sub-spaces p, p ' generate the same symmetry 
class = §p = §p' of tensors if they have the same core. We 
may therefore restrict ourselves ab initio to the consideration of 
invariant sub-spaces p. 

* This " invariant sub-space " is not the same as an " invariant sub* 
algebra ” as defined in Chap. Ill, § 13 ; to conform with our previous nomen- 
clature it should be called a " left-invariant sub-algebra.” 
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It is possible that certain relations (2.1) will be satisfied by- 
all tensors. Let r 0 denote the smallest sub-space of r which 
contains the elements F(if 2 • • • i f ) associated with all tensors 
F and all values of the indices (i x i 2 • • • if). Then p generates 
the same $($ = $p as the intersection of p with t 0 ; it is therefore 
natural to restrict ourselves further to the consideration of 
invariant sub-spaces p of t„. These remarks are not applicable 
if the dimensionality n ^ /, for certainly the /! coefficients 

sF(l, 2, • • ", f) = F( V, 2', • • •,/') 

of the arbitrary tensor F are independent. But the situation 
is different in case n<f: for example, let S, = ± 1 according 
as s is an even or an odd permutation ; then 

I K'SF 


is an anti-symmetric tensor and must therefore vanish in case 
the dimensionality « is less than the order /. 

We can at most hope that conversely p is uniquely determined 
by $ if we restrict ourselves to invariant sub-spaces p which are 
contained in r 0 . In order to prove that this is indeed the case 
we attempt to find the inverse process which leads from 
to p, following the programme outlined by the geometrical 
analogy considered above. In case n ^ / this is readily done 
as follows: if F is any tensor in $ we let the element 
x = F( 1, 2, •••,/) in r correspond to it ; p consists of all the 
elements x so obtained. But in order to obtain a method which 
is also applicable to the case n<f we must alter the procedure. 
We understand by p = ^ the smallest linear manifold containing 
the totality of elements F(i 1; i 2 , • • •, i f ) associated with all possible 
tensors F of $ and all possible combinations of indices (if., •••*/). 
If the tensors E x constitute a basis for p consists of all elements 
of the form 


X = ££Ca 
* (0 


if) • B a {i\ 


V) 


( 2 . 6 ) 


That such a p is invariant has already been shown above, for 
lf x t 5 F Wt •*•*/) the element x’ defined by x’(s) = x(rs) is 
equal toF(^ • • . k,) where k x k 2 • • . k t are obtained from 
l i z i ' ' m tf by the fixed permutation r . 

We now denote the t 0 introduced above by baft/ ; it coin- 
cides with the entire space t when n ^/. Let the symbol -3 
denote is contained m ” ; the following results then follow 
r the ^ efimtl0ns : If * is a linear sub-space of 

If $ is a j?y linea r sub-space of 
9t and p — t)$, then conversely -g $p. We can at most 
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expect that the symbol -g can be replaced by = if in the first 
theorem p is an invariant sub-space of t 0 and in the second if 
$($ is an invariant sub-space of ffi . That these converse theorems 
are in fact true under these limitations will be proved in § 4. 

§ 3. Invariant Sub-spaces in Group Space 

We are in need of a fundamental theorem concerning the 
algebra of a group as a preparation for carrying through the 
investigation proposed above ; we here prove this theorem for 
a general finite group. However, we do not alter the notation, 
so here tt denotes any finite group of order h. 

Theorem (3.1). If p is an invariant sub-space of x there exists 
an element e of the group algebra having the following two prop- 
erties : (1) every element of the form xe belongs to p, (2) every 
element x of p satisfies the equation xe = x. 

In particular (1) implies that e = le itself belongs to p, 
and hence by (2) ee = e; e is idempotent . 3 It is a “general* 
ing unit 99 of p in the sense that p consists of all elements of the 
form xe . 

Proof Let e lt e 2 , • • *, e h be a co-ordinate system in the 
vector space t which is adapted to the g- dimensional sub-space 
p in such a way that p is the linear set defined by e h e 2 , • • *, e g . 
The parallel projection which transforms 

x = x x e x + • ■ • + oc h e h into x' = x 1 e l + * • • + x g e g 

has the two properties (1) it projects every x into an x' lying in 
p, and (2) within p it is the identity. In the original co-ordinate 
system defined by the simple elements 5 of the algebra this 
projection is given by 

x'{s ) = zd(s, t)x(t), 

t 

where the matrix d(s, t) is necessarily of the form 

d(s, t ) = e x (s)^(t) + • • • + e a (s)e 0 (t) 

and the e { (s) are defined by 

£ei{s)e k (s) = S,-* (i, k = 1, 2, • • •, g), 

8 

The fact that p is invariant implies that if x is in p then the 
element x r defined by # r (s) == x(rs) is also in p . Consequently 
the projection with the matrix d(rs, rt) has the same two prop- 
erties (1) and (2), where r is any fixed permutation (i.e. element 
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of the group 7 r) whatever. Hence the assertions also hold for 
the correspondence with the matrix 

e(s, t) =\zd(rs, rt) ( 3 . 2 ) 

obtained by summing over all elements r of the group. This 
matrix satisfies the equation 

e[rs, rt) = e(s, t), 

whence e{s ] t) depends only on the combination tr x s : e(s } t) = 
The linear projection 

x'(s) = Ze{s, t) x(t) 

t 

may therefore be written briefly x f = xe } which proves the 
validity of the theorem. 

Let the invariant sub-space p be completely reduced into two 
invariant sub-spaces : p = p x + p 2 , and let e be the generating 
unit of p. Any element in p can be written as the sum of 
its components in p x and p 2 ; hence in particular e = e x + e 2 . 
From this it follows that for an arbitrary element x of p 

x = xe = xe x + xe 2 . 

But since x 1 = xe x is in p x and x 2 = xe 2 is in p 2} x x and x % 
are the (unique) components of x in p x and p 2 . These two 
components for the element e x are obviously e x and 0, whence 

e x e x = e l9 e x e 2 = 0 ; 

similarly 

c 2 o x = 0, e 2 e 2 — 

Hence e Xi e 2 are the generating idempotent units of p Xf p 2 re- 
spectively; they are “independent” in the sense of the 
equations 

^ 1^2 = 0 ) ^ 2^1 = 0 * 

On completely reducing p into any number of components : 
-P = Zpi, the generating unit e of p is decomposed into 

e = Ze e 

i 

the components of which satisfy the analogous equations 
e * e * “ ® ( z * 4= k), e t e< = e im 

. existence of the generating unit offers a means of ob- 
taining a new and simpler proof of the fact that reducibility 
implies complete reducibility : 
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Theorem (3.3). If p, are invariant and p x -3 p, then p can 

be reduced into px + p 2 in such a way that p 2 is also invariant. 

Proof. Let e x be the generating unit of p x . We decompose 
every element of p in accordance with the equation , 

x= xe x + (x — xe x ). (3.4) 

The first component x x = xe x lies in p l5 and the second 

x 2 = x — xe x 

runs through a certain linear sub-space p 2 of p when x runs 
through all elements of p. This sub-space p 2 is also invariant, 
for 

ax 2 = ax — (ax)e x 

as ax is in p if x is. The elements x X) x 2 of px, p 2 respectively 
satisfy the equations 

x x e x = x X) x 2 e x = 0. 

From this it follows that the sum of an element y x of p x and an 
element x 2 of p 2 cannot vanish unless bothy-* and x 2 also vanish ; 
hence p x and p 2 are independent. To prove this we merely note 
that on multiplying y x + X 2 = 0 by e x we find y x e x — y x = 0. 
Equation (3.4) represents the reduction of any element of p 
into its components in p x and p 2 . 

Any idempotent element e generates an invariant sub-space 
p e consisting of elements of the form xe. If e 1} e 2 are two 
independent idempotent elements [e x e 2 ~ 0, e 2 e x = 0) then the 
sub-spaces p X} p 2 which they generate are independent, and the 
idempotent element e = e x + e 2 generates p = + p 2 - An 

idempotent element e is said to be primitive if it can only be 
expressed as the sum of two idempotent elements e x + e 2 if 
one of the summands is 0 (and the other e). In order that p e 
be irreducible it is necessary and sufficient that e be primitive. 

Obviously any idempotent element e, in particular the 
modulus 1 of the algebra, can be reduced into the sum of 
independent primitive idempotent elements. For if we have 
a reduction into independent non-vanishing idempotent elements 

e = ex + e 2 + * • • + e m 

and if, for example, e x is not primitive, it can be further re- 
duced to the sum of two independent non-vanishing idempotent 
elements e x ' + ; in this way we obtain a complete reduction 

of e into m + 1 independent terms, for we have, for example, 

e x e 2 = = 0 ; similarly e 2 e' x = 0. 
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This process must certainly cease after at most h steps. Our 
analysis allows us to assert that we thus obtain a complete 
reduction of p 6 into independent irreducible sub-spaces. 

We have seen that the theorem concerning the complete 
reducibility is a consequence of the existence of a generating 
unit. But the converse is also true : If £ appears as a summand 
in a complete reduction t = p + p' of our given algebra r, then 
it possesses a generating unit. We need only to specialize the 
considerations developed above by applying them to the modulus 
1 of r ; 1 can be completely reduced into the two components 
e + e' lying in p and p\ and the generating units of p and p' 
are e and e' respectively. 

The mathematician will find it worthy of note that all these 
considerations are still applicable when the algebra is defined 
over any field whatever. Instead of dealing with the continuum 
of real or complex numbers, as in analysis, we may in abstract 
algebra operate in an arbitrary field , i.e. a domain of elements, 
called numbers, in which the two fundamental operations of 
addition and multiplication and their inverses, subtraction and 
division, are defined in accordance with the formal laws of 
ordinary arithmetic. Our development depended only on these 
rules of operation — with a slight restriction. There are fields in 
which a definite integer, say h, times any number of the field 
yields zero ; we may say that h annihilates. Such 4 4 modular ” 
fields must be excluded, for we wish to retain the possibility 
of finding a number such that its product with h is any given 
number. When our reasoning involves no more restrictive 
assumptions concerning the number field, we are operating in 
a relatively elementary theoretical domain. However, such 
theorems as the 44 fundamental theorem ” III, (10.5), and that 
of Burnside-Frobenius-Schur, which depend on the fundamental 
theorem of algebra, belong to a deeper layer. These theorems 
hold only in “ algebraically closed ” number fields, in which 
any algebraic equation (with coefficients in the field) is soluble. 
Finally such concepts as 44 Hermitian,” 44 unitary,” etc., involve 
the transition from a number to its conjugate complex and 
have no place in general abstract fields. Our earlier proof of 
the theorem of complete reducibility was obtained with the 
aid of such tools foreign to the general concept of a field. 

Theorem (3.5). A similarity projection x -* x' of the invariant 
subspace p on the invariant subspace p' is necessarily expressed 
by an equation of the form x ' = xb . (In particular, when p 
and p' are equivalent this theorem is applicable to the one-to-one 
similarity correspondence p ^ £'.) 

Proof Let the given similarity correspondence send the 
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generating unit e of p over into b. In virtue of the similarity 
xe then goes over into x f = xi?, where x is any element in b * 
but for such an element xe = x. 

Additional remark The projection sends e into eb ; hence 
eb = b. ' On the other hand, if e' is the generating element of 
p' } then since b is in p’ we have be’ = h : 

h = eb = be' = ebe\ 

We express this result, i.e. that b is of the form exe by saying 
b has the character (e, e'). Our considerations show that such 
a projection can always be expressed in terms of a unique 
element b of character (e, e'). 

If we are operating in the field of complex numbers, with which the 
investigations of analysis (e.g. the theory of functions) deal and in 
which we are exclusively interested in quantum theory, we may supple- 
ment the theorem (3.1) concerning the existence of a generating unit e 
in an invariant sub-space p by the following : 

The generating unit may he so chosen that it is real ; it is then deter - 
mined uniquely by p. 

To prove this we choose as the basis e lt e 2 , . . of p a unitary- 
orthogonal system of vectors ; then 

Sei{s)e k (s) = S ik (», k, = 1, 2, . . g). 

8 

In constructing d(s, t), which we now denote by e(s , t), we may therefore 
choose * i i : 

e(s, t) -ie^e^t), (3.6) 

I assert that the equation 

e(rs, rt) = e(s , t) (3.7) 

is automatically satisfied — it is no longer necessary to take its mean 
value as in (3.2). The element e defined by etf-'s) ~ e(s, t) is then the 
real generating unit of p. 

In order to establish the validity of (3.7) it is only necessary to 
note that e(s t t) is independent of the particular unitary basis e lt e lt 
. . e Q chosen ; for on going over to a new unitary basis e', e 2 , . . ., 
e g by a unitary transformation U the bilinear form (3.6) remains in- 
variant. Now in particular the equation 

e’i(s) = «*(«). 

in which r is a fixed element of the group, defines a transition to a new 
unitary basis. 

To prove that this real generating unit e of p is unique, assume there 
exists a second, e' ; then all elements x of p satisfy the equations 

xe = x, xe' = x. 

On applying the first equation for x = e‘ and the second for x = e we 
have 

e f e — e', ee' = e. 
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But since e and e' are both real, the first of these results yields, on 
going over to the Hermitian conjugates, 

ee' = e\ 

and from this and the previous result we conclude that e f = e. 

Under these conditions the content of theorem (3.3) can be extended 
and its proof simplified. If e, e x are the real generating units of p, p x 
respectively, then since e x is in p e x e = c x , and on going over to the 
Hermitian conjugates we find ee x = c x . Hence the idempotent element 
e, introduced by e = e x + e 2 is real and independent of = Pi + ; 
is thus completely reduced into pj and an invariant sub-space p s which 
is unitary-orthogonal to pi and which has as its real generating unit e s . 

§ 4. Invariant Sub-spaces in Tensor Space 

We now return to the investigation of tensors of order /, 
the totality of which constitutes the space ffi. Let tt again be 
the group of all permutations of / things and r ( = p) the corre- 
sponding group space (algebra). Let a be a symmetry quantity, 
i.e. an element of the algebra p , with components a(s) ; the 
element & is then defined by 

d(s) = a(s-') ( 4 . 1 ) 

The relation 

F' = aF ) 

which asserts that the tensor F' is obtained from F by the 
operator a, is equivalent to the equation 

f = f • a 

between the corresponding elements F and F' of the algebra p . 
For 

sF' = Zdijr 1 ) • stF 

t 

is in fact obtained from 

F f = Za(t) • tF = Zd{r*) ■ tF 

by operating on it with the permutation s. 

In the following considerations, which are concerned with 
symmetry classes of tensors, p (with or without index) always 
denotes an invariant sub- space of t, a the generating unit of 
p and the corresponding We may then say that e is 
the generating idempotent operator of the symmetry class in 
the following sense : 

(1) eF lies in F being any tensor whatever ; 

(2) if F is in $ it is reproduced by the operator e : eF = JF, 
In this way we obtain a constructive definition of the symmetry 

class as the totality of all tensors of the form eF . This definition 
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is considerably simpler than the original one in terms of p, for 
it depends on a single element e instead of a manifold p. If, 
for example, we are dealing with the class 5J5 of all completely 
symmetric tensors 



is such an operator ; the corresponding operator for the class 
of all anti-symmetric tensors is the alternating sum 


Theorem (4.2). If p' -3 p or p = p t + p 2 , we have 5(5' -3 5(5, 
5(5 = 5J5 X + respectively. 

We need to prove only the latter part of this theorem, 
i.e. for the case of complete reduction. The generating unit 
e = e x + ^2 °f P ^ as as components e 1} e 2 in p 1? p 2 the generating 
units of p x , p 2 respectively. The formula 

eF = e x F + e 2 F 

defines the corresponding complete reduction of 5(5 into the 
independent invariant sub-spaces 5(5 2 . 

Theorem (4.3). If p x ^ p 2 then 5(5|. ~ 5(5 2 . 

The similarity correspondence x x -> x 2 of p x on p 2 is, by 
theorem (3.5), of the form 

x 2 = x x b } x 1 = x 2 b r . 

Hence 

s bF x , F x = b'F 2 

define a one-to-one similar correspondence of 5(5x on 5(5 2 an d its 
inverse. 

Theorem (4.4). If p -g r 0 /Am p = l|5(5. 

The only non-trivial part of this first converse theorem which 
remains to be proved is that p -g t|$(5. All tensors of the form 
F a = eE * are in 5(5, where (£?*) is a basis for the entire tensor 
space 9V ; hence all elements of the form 

y = EcSi * • * if) * P*(ii * ■ * if) 

<x, i 

are in [j^. On introducing 

x = £c a (i t • • • if) ■ B»(*\ • • • */) 

a, i 

we have ,y = x&. On recalling the definition of t 0 = tjSftf we 
see that xe belongs to if x lies in r 0 . But in virtue of the 
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assumption that p -g t 0 this is automatically satisfied if x is an 
arbitrary element of p ; but then xe = x . Hence every element 
x of p is contained in 

In order to formulate the converse of these theorems let 
(with or without index) now denote an arbitrary invariant 
sub-space of ffi and p the corresponding 

Theorem (4.5). If -g $ or $|3 = + $ 2 > then p' -3 p t 

p = p t + respectively . 

Theorem (4.6). If ^ then p ^ p\ 

Theorem (4.7). = $p. 

The last theorem is by far the most important of all ; it 
asserts that every is a symmetry class of tensors . It is desirable 
to prove it first, i.e. to prove that §p -3 $. Let 6 again denote 
the generating unit of p ; §p then consists of all tensors of the 
form F' = eF. Since the element & belongs to p it is necessarily 
of the form 

S[s) = e(s~ x ) = Ze a [h x • • • k f ) • s£„(fc x • • • &,), (4.8) 

a,* 

where the tensors E a constitute a basis for the space Now 
the trivial equation 

Zsc{h • • • if) • sF{i x • • • *,) = * * • */) ■ ^(*1 ' • • h) 

i i 

shows, on replacing sc by c ) that 

Zc[h • • • if) • sF(i x • • • if) = Zs-H{i x • • • *,) • F{i x • • • i f ). 

i i 

Hence we may replace (4.8) by 

e ( s ) = Zse a (k x • • • kf) • E x {k x • • • k f ) 

a,i 

and the coefficients of F f are then given by 

F '(h •••*/) = 2X(h • • • if ; k x • • • kf)E„(k x • • ■ k f ) 

<X,k 

where 

c «(*i K - • • k,) = ZsF(i t • • • if) • • * • kf). 

8 

Because of the summation over all elements s of the group tt 
this transformation with coefficients c a is symmetric ; hence 
the assumption that the sub-space is invariant allows us to 
conclude that F ' lies in if the E a do. But this establishes 
our theorem. 

The theorem can also be proved directly, without calling 
on the theorems of § 3, in the following way. That F is in 
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means that F(i 1 i 2 • • • i f ) is in p and is consequently of the 
form (2.6) : 

P(i i • • • if) = SbJk * • * if) h • • • k f ) • EJk, • ■ • k f ). 

a.,k 

The E x constitute a basis of Writing down the ^-component 
of this equation and replacing the indices • • •, i r by i x • • •, i f 
we find the equation 

F{i\ * * * if) == if > ^/) * E a [h i * * • &/) 

oc.,k 

for the components of F. Since this holds for every permutation 
s"" 1 we may sum over the elements of the group and obtain 

F(ii • • * if) = 2Jc a (ii if J &/) * E<x{ki * * * fe/), 

a, ifc 

where the coefficients 

*.(*i •••*/) •••*/; h - • • kf) 

are symmetric. Hence since the F* belong to the invariant 
sub-space $ and F is obtained from them by a symmetric 
transformation, F also belongs to 

The only part of theorem (4.5) which is not self-evident is 
the assertion that p 1} p 2 are independent. By theorem (4.7) we 
have the relations 

Ir -s Ipi -3 to* -3 

for the (invariant) intersection p* of hi and ^ 2 * But since 
^i) $2 are independent it follows that $p*, and therefore £*, 
is empty. 

Theorem (4.5) shows the $ associated with an irreducible J) 
is also irreducible , Hence it follows, in particular, that the 
manifold of symmetric and the manifold of anti- symmetric tensors 
are irreducible and invariant , not only with respect to the algebra 
of symmetric transformations, but also with respect to the 
transformations induced in tensor space by the affine or unitary 
groups of transformations in the vector space 9t. Applying 
this to the 2-dimensional vector space, we see that the repre- 
sentations E, of c = c 2 or u constructed in III, § 5, are irreducible. 

In order to prove (4.6) we must first examine the nature of 
T 0 (for n<f) in some detail. We call the component a(l) of 
an element a of the algebra the trace of a. Hence the trace 
of the product aft, which we call the scalar product tr (aft) 
of a and ft, is 

tr(aft) = £a(s)b(s~ x ). 

V 
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The trace of a is then tr(al) = tr(Ia) = tr(a). The scalar 
product is obviously symmetric in a and b, and the symmetric 
bilinear form tr (ab) is non-degenerate, i.e. a = 0 is the only 
element for which the equation tr(ax) = 0 is satisfied identically 
in x . 

Auxiliary theorem (4.9). t 0 is a left - as well as right-invariant 
sub-algebra of X. tr (ab) is non- degenerate within t 0 , i.e. the only 
element a of T 0 whose scalar product with every element x of x 0 
vanishes is a — 0. 

The first part of this theorem is almost self-evident. For 
if x = F(i \ • • • if), the element x' defined by #'($) = x(sr) is 
F\h * * 4 if) where F' = rF. 

Let i be the generating unit of t 0 , a an element of t 0 and 
x an arbitrary element. Then since t 0 is right-invariant ax 
is also in r 0 , whence 

ax — ax - /, tr (ax) = tr(a • xi). 

Now xi is in r 0 ; hence if the scalar product of a with every 
element xi of r 0 vanishes then tr(ax) = 0 without restriction on 
x . It therefore follows that a = 0, as asserted. 

Proof of theorem (4.6). Let E a be a basis for and let the 
similarity correspondence of $ on send E a into the basis E' a 
for.$($'. Let cjfx • * • if) be a given system of coefficients and 
write 

c = Zc a {ii • • • if) • £„(*! • • • if) (4.10) 

a, i 

c' — £c a [i i • • • if) • E'Ji-L • • • if) . 

*, i 

The desired similarity correspondence between p and p f is naturally 
to be defined by c c'. However, this is only possible provided 
two systems of coefficients cjfx • • • if) which define the same 
c also define the same c ' ; or a system of coefficients which 
causes c to vanish must also cause o’ to vanish. 

We first remark that if a tensor F satisfies the equation 

G - Zc(s-') ■ sF = 0 

a 

then also 

G' = Ec\s~ l ) ■ sF =-- 0. 

By (4.10) 

c(s 1 ) = Zs cjk 1 kf) • E x {k x • ■ • kf), 

whence 

G(h ' • • if) = • * * if ; kx • • • kf)EJJtx • • • kj) 
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where 

c Jfi • • • if 5 k x * * kf) = JF$F(i 1 • * • i/) • sc a (^ 1 • • • kf ). 

* 

These c a define a symmetric transformation. Hence the given 
similarity transformation 5(5 5(5', which sends E a into E a \ sends 

G into G This proves our assertion that the vanishing of 
G implies the vanishing of G'. 

If c = 0 we then have 

2y(s _1 ) • S-F(h • • • if) = tr[c' • Ffo • • • i y )] = 0 

f 

for all tensors E and all combinations of indices i x • • • or 
tr(c'x) = 0 for all elements x of t 0 . Hence by the auxiliary 
theorem (4.9) c' = 0. 

The result of our investigations is that there exists a one-to-one 
correspondence between the mvariant sub-spaces (3 of t 0 an & the 
invariant sub-spaces 5(5 of SPA This correspondence is as close 
as possible ; irreducibility , complete reduction ) equivalence and 
inequivalence on the one hand imply the same on the other. In 
particular, we emphasize the further consequence : 

Theorem (4.11). Every mvariant sub- space 5(5 of ffi, in 
particular ffi itself can be completely reduced into irreducible 
invariant sub-spaces. 

I hope that our elementary methods have made this corre- 
spondence quite apparent. 

It is evident a priori that we can completely reduce the 
modulus 1 of the algebra p into a sum e 1 + e 2 + • • • + of in- 
dependent primitive idempotent elements. The formula 

F = e x F + e*F + • • • + e m F 

then gives the complete reduction of ffi into independent in- 
variant sub-spaces 5(5 X , 5(5 2 , • • % 5JS*,, each of which is generated 
by one of the idempotent operators e. (5(5 X consists of all tensors 
of the form e x F.) From this point of view we might consider 
as the only non-trivial result of our investigation the assertion 
that the 5(5 generated by a primitive e is irreducible (with respect 
to the algebra S of all symmetric transformations). Physically 
this means that the class of terms .corresponding to such a 5(5 
cannot be further divided into parts which cannot under any 
conditions interact with each other. If in spite of this there 
does exist such a decomposition it is accidental — i.e. attributable 
to the special dynamical situation in the case in question. 
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§ 5. Fields and Algebras 

We here interrupt our development in order to present an 
axiomatic treatment of the two fundamental concepts field and 
algebra ; our investigation has revealed the importance of these 
concepts for quantum theory. The physicist who is not par- 
ticularly interested in such a treatment may well omit these 
sections. 

A field is a domain of elements, called numbers, within 
which the two operations of addition and multiplication are 
defined and which associate with any two numbers a, /? of the 
field certain unique numbers a + P, a/? respectively. Addition 
obeys the commutative and associative laws 

« + £ = £+«. (« + 0 +y = «+(0+y) 

and has a unique inverse, subtraction. From this follows the 
existence of a unique number o (zero) with the property 
a + o = o + a = a for all a. Further, associated with each 
number a is a number — a, its negative, such that a + ( — a) = o. 

We require that multiplication obey the associative law 

(« P)y = 

and the distributive laws 


(a + fi)y — (ay) -f (fiy), a (fi + y) = ( a j8) -f- (ay) 

with respect to addition. From the distributive law follow 
the relations 

ao = oa = o. 

Multiplication need not be commutative j in case it is we speak 
of a commutative field • Further, division by any number 
other than o shall be possible and shall lead to a unique quotient 
i.e. each of the equations A 


= 7)<X. = f} 

have for given a =# o and given p one and only one solution 
$,7) respectively. From this it follows that the product ocfi of 
two numbers can only be o if one of the two factors is o. As a 
further consequence, there exists a number s, “ one ” or “ uni tv ” 
with the property that 

ocs = eoc = a 


for all a. We explicitly assume that not all numbers equal o • 
then m particular s + o. Every number a * o possesses a 
unique reciprocal a -1 with the property aa -1 = a -1 a = e 
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We must introduce in addition to the numbers of our field 
the ordinary numerical symbols 1, 2, 3, * * \ Their inter- 
pretation as multipliers is given by the equations 


la = a, 2a = a + a, 3a — (2a) + # * *, 


in general 


(n + l)a = (ft a) + a. 


In particular we can construct the series 

le, 2e, • • •, ns, • • • (5.1) 

of multiples of s. We then have two possibilities. (1) All the 
numbers of this set may differ from s ; then they are all different, 
and we can conclude with the aid of the equation 


np = ne - ft 


and the division axiom that for a given number a there exists 

■P QC 

one and only one number p = - which satisfies the equation 

n/3 = a ; we can then introduce ordinary rational numbers as 
multipliers. (2) The second possibility is that one of the multiples 
in (5.1) is equal to s itself ; let the least multiple of this kind be 
pe. Then the numbers of the series (5.1) repeat in cycles of 
length p. p must be a prime number, for if p were the product 
of two integers m, n smaller than p we would then have 


o = pe = me * ne, 


but by assumption neither me nor ne are o, for pe is the lowest 
multiple of this kind, and this is contrary to the division axiom. 
In this case we are dealing with a finite field of modulus p. A 

In order not to lose ourselves in too broad generalities we 
now take as our number domain a commutative field and define 
a linear associative algebra of finite order over this field. 
By number we mean the elements of the field, and denote its zero o 
and its unit e by 0 and 1 ; by element we mean an element of the 
algebra. We denote the former by small Greek and the latter by 
small Latin letters. An algebra is characterized by three fundamen- 
tal operations : addition of two elements, a + b ; multiplication of 
an element by a number , ya ; multiplication of two elements, ab . 
The first and second of these operations obey the familiar axioms 
of vector calculus (I, § 1), which we set forth here again for the 
sake of completeness. 

Addition is commutative and associative and has a unique 
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inverse, subtraction. It then follows that there exists a null- 
eletncnt o. Multiplication by a number obeys the laws 

la a, a 03c) -= (a fi)c, 

(a \~ fi)c (a c) + (fie), oc [b + c) = (oc$ + (ac). 

The order h is introduced by the dimensionality axiom : every , 
h | 1 elements of the algebra are linearly dependent, the co- 
eltieients in the equations expressing the dependence being 
numbers of the field, but there exist h linearly independent 
elements. A set of h such elements e u e it • • *, e h) called “ basal 
units," form a basis for the algebra in the sense that any element 
a can be expressed in one and only one way in the form 

a a. it' i f- a 2 f a + • • • + 

and can be replaced by the set (a 1( a 2 , • • •, a A ) of h numerical 
components. 

Multiplication of elements among themselves obeys the 
distributive laws 

pi | b)c (ac) •}■ (Mi c ( a + b) = ( ca ) + (cb) 
tor both factors and the associative laws 

ya - b ■ - y(ab), b- ya — y(ba), 

( ab)c --- a(bc) 

We neither assume that multiplication is commutative nor 
that it possesses a unique inverse, division. But we do assume 
that the algebra possesses a “one,” the modulus (ox principal 
unit , i.e. an element e with the property ae — ea — a for all 
elements a. We shall usually not hesitate to denote the zero 
and one of the elements of the algebra by 0 and 1. 

If we assume the possibility of division the algebra reduces 
to a fin general non-animutative) field or division algebra of 
liuitr order h over the given field. 

§ 6. Representations of Algebras 

|-i 1 1 the sake of the printer and in order to give the text a 
more peandul appearance we no longer emphasize the elements 
of our algebra by expressing them in boldface type. Thts 
applies in particular to the elements of the algebra p of sym- 
metry quantities " which we may often denote by this latter 
expression in case of possible confusion with the elements of 
Utr im»lrtlyi <45 fjniup. We still employ this means of distmguisa- 
me between the tensor /*' and the symmetry clement F or when 
w« wedi to consider an element as an operator acting on a tensor. 



REPRESENTATIONS OF ALGEBRAS 305 

We start with an algebra p of finite order h, the elements of 
which constitute an ^-dimensional vector space r, and associate 
with the element a of p the correspondence 

(a) : x x' == ax 

of t on itself. We consider the algebra (p) of transformations 
(a), which is simply isomorphic with the algebra p, as funda- 
mental for the vector space t, i.e. the term reducible, invariance, 
etc., as applied to sub-spaces of t are with respect to the 
group of transformations ( a ). We assume that x can be com- 
pletely reduced into irreducible sub-spaces pi + “h * * * ; each of 
these sub-spaces then contains an idempotent generating unit 
<?i, e 2 , • * * . We have already seen that this assumption is true 
for the algebra associated with any finite group — at least under 
the restriction that the field over which the algebra is defined 
does not have as modulus a prime number which is a factor of 
the order h of the group. 

We discussed the representations of a group or of the corre- 
sponding algebra in Chapter III. We found that the irreducible 
representations are subject to certain important conditions 
which, surprisingly enough, limit their number and which, 
together with the as yet unproved “ completeness theorem,” 
lead to the reduction of the given algebra into independent 
simple matric algebras (III, § 13). That we were unable to 
prove the completeness theorem with the methods there em- 
ployed was to be expected, for we assumed that the representa- 
tions were given and examined their properties ; we had no 
general process for the construction of representations of the 
given algebra. But we are now in possession of the materials 
for such a construction : the reduction of t into irreducible 
sub-spaces p 4 reduces the regular representation into as many 
inequivalent irreducible representations of our algebra as there 
are inequivalent invariant sub-spaces We shall now carry 
out this construction process to the point of obtaining the re- 
duction of our algebra into independent simple matric algebras ; 
it will be desirable to derive the previous results again from this 
standpoint. A further difference between this investigation 
and that of Chapter III consists in the fact that we here refrain 
as long as possible from placing restrictive assumptions on the 
commutative field over which the algebra is defined ; only at 
the end of the investigation do we discuss the advantages at- 
tributable to the fact that the continuum of complex numbers, 
the only field in which we are interested for the physical appli- 
cations, is algebraically closed . 
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Theorem (6.1). Every representation of the algebra p is com- 
pletely reducible into irreducible representations . Each of these 
irreducible constituents is equivalent to the representation induced 
in some p t by the regular representation . 

(Hence the complete reducibility of the given algebra implies 
the complete reducibility of its representations. Further, every 
irreducible representation is contained in the regular repre- 
sentation, which therefore constitutes an appropriate starting 
point for obtaining all representations by the method of reduction). 

Let $ be an w-dimensional representation, and let e lf e 2 , • • •, 
e n be n fundamental vectors constituting a co-ordinate system 
in the representation space 3 ft of §. If the element a of the 
algebra corresponds to the linear correspondence A in Jp, we 
interpret the equation 

j' = ai as £' = A%, 

where j', j are vectors in 9t. If e is a given fixed vector and x 
runs through all elements of one of the irreducible invariant 
sub-spaces p = p x of r then, as we shall show immediately, 
xt runs through a certain sub-space p(t) of 9t which is invariant 
with respect to £>. Indeed, the transformation A associated 
with an arbitrary element a sends xt over into ( ax)t , and if 
x is in p f ax is also. p(e) is either 0 or is similar to p in the sense 
that different x generate different images xt } for those x of p 
for which xt = 0 constitute an invariant sub-space p' of p , and 
in virtue of the assumption that p was irreducible p' must 
either be 0 or p itself. Hence if p(t) =|= 0 the representation 
induced in p(e) by § is equivalent to the regular representation 
in p. 

These considerations are to be supplemented by the following 
remark. If $ is any invariant sub-space of 91 then p(t) is either 
independent of or is contained entirely in for those elements 
x of p for which xt lies in $ constitute an invariant sub-space 
of p , which is therefore necessarily either 0 or p itself. 

Now construct successively 

Plfcl); ?2( e i)> * * ’> 

^Pl(^s) ) p2^2)i * * 

^2^n) > ■ * ’> 

Each sub-space in this list is either entirely contained in the 
sum of the previous ones or is independent of this sum ; on 
retaining only those sub-spaces for which this latter possibility 
is realized we obtain a reduction of 9i into certain invariant 
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sub-spaces p<(c fc ). To prove this theorem we need only to note 
that the sum of the sub-spaces contained in the first row con- 
tains at least the vector e b that on adding to them the sum of 
those contained in the second row we obtain at least the vector 
e 2 in addition, etc. 5 

The theorem just proved is in particular applicable to the 
symmetric group 7 t, and we now wish to establish the analogue 
for the algebra E of symmetric transformations in the space ffi 
of tensors of order /. We already know that ffi can be reduced 
into sub-spaces which are irreducible with respect to E 
(provided the number field over which E is defined does not have 
as modulus a prime 5^/). Every transformation A of E is at 
the same time a transformation A { of ^ on itself and the corre- 
spondence A -> Ai is naturally a representation of E, the 
“ representation induced in ^ by the algebra E We wish to 
show that the representations of E are completely reducible 
into irreducible constituents, and that each of these constituents 
is equivalent to the representation induced in some by the 
algebra E. Naturally this does not follow immediately from 
theorem (6.1) ; in order to establish the connection between 
the two we must show that the complete reducibility of ffi into 
irreducible invariant sub-spaces implies the same for the 
algebra E. We apply the notation and conventions given at 
the beginning of this section to the algebra E: (A) is the 
correspondence 

S-+S' = AS 

of the “ vector space ” E on itself, A (A) the regular repre- 
sentation of E ; the algebra of transformations (A), which is 
simply isomorphic with E } is taken as fundamental in the vector 
space E } i.e. the transformation group of E consists of the 
transformations (A). 

Theorem (6.2). Let E he an algebra of transformations in a 
vector space 9t, and let St he completely reducible with respect to 
this system E of transformations into irreducible invariant sub- 
spaces Then E is itself completely reducible into irreducible 
invariant sub-spaces JJ h and the representation induced by the 
regular representation in TIj coincides with (more precisely , is 
equivalent to) the representation induced in one of the irreducible 

by the algebra E itself. 

This theorem holds without any restrictions on the field 
over which E is defined. Let 77 be an irreducible invariant 
sub-space of E (consisting not merely of the transformation 0), 
and let R =(= 0 be a transformation of TI. There then exists 
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a vector a in SR such that Ra =j= 0. Let a be decomposed into 
its components a< in the various sub-spaces 5|$< ; at least one of 
these components, say a< = e, must be carried over into a vector 
Re + 0 by R. We now hold e fixed and let 5 in 8 = 5e run 
through all transformations of 77; these § then constitute an 
invariant sub-space 17(e) of 5(5 = $<. The “ typical reasoning " 
already applied in the proof of the previous theorem then allows 
us to conclude that : 

(1) 77(e) is either 0 or 5p, as $ is irreducible; in this case 
it is necessarily 5(5, for the vector Re 4= 0 belongs to 77(e). 

(2) 5 = 0 is the only transformation in 77 which sends e 
over into 0, for those 5 of 77 for which Se — 0 constitute an 
invariant sub-space of the irreducible sub-space 77. Hence 
8 = Se sets up a one-to-one correspondence between 77 and 5(5. 

This correspondence is similar, for S' — AS implies that 
the vectors 8 — Se, 8' = S'e satisfy the equation 8' = A8. We 
have thus proved the second part of our theorem : the repre- 
sentation induced in 17 by the regular representation coincides 
with the representation induced in 5(5 by the algebra itself ; 
briefly, 17 is similar to some 5(5^. 

Since Se runs through the entire sub -space 5(5 when 5 runs 
through 77 there exists an E in 77 such that Ee — e ; then 
E*e = e. Since the transformations E and JS 1 of 77 both 
associate the same image with e they are identical : E is idem- 
potent. Hence 27 can be completely reduced into two inde- 
pendent sub-spaces 77 + 27' in accordance with the formula 

S = SE + (S — SE). 

[Cf. the proof of Theorem (3.3).] Successive application of 
this procedure leads to the complete reduction of 27 into its 
constituents 17). 

Having proved Theorem (6.2), we obtain from Theorem 
(6.1), under the same assumptions, the further theorem : 

Theorem. (6.3). Every representation of 27 is completely 
reducible into irreducible representations. Every irreducible re- 
presentation of 27 coincides with the representation A-+ A t 
induced in some 5($< by the algebra 27 itself. 

Theorem (6.1) yields the further (rather uninteresting) fact 
that not only is every 77) similar to some 5(5^, but also conversely 
every 5(5< is similar to some 77). 

As has already been indicated, all of these results are applic- 
able to the algebra of symmetric transformations in tensor space 
5R . But we have shown in § 1 that this algebra can be replaced 
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by the group (c :)/ induced in tensor space by the group c of 
linear transformations 

n 

x 'i — 2>{ik)x k [det [a(ik)\ =f= 0] (1.3) 

of n-dimensional vector space, i.e. by the representation (c)/ of 
C. We shall say that a representation of c is of order / if the 
components of the matrix A, which corresponds to the element 
(1.3) of the group, are rational integral functions of the a(ik) 
of order /. Our theorem then asserts : 

Theorem (6.4). Every f th order representation of c is com- 
pletely reducible into irreducible representations , and every irreduc- 
ible representation of order f oft is contained in the representation (c)L 
This theorem is still valid on restricting the affine group c to 
its unitary sub-group u. (Naturally the concept “ unitary ” im- 
plies that we are then no longer dealing with an arbitrary field, 
but are operating in the field of all complex numbers.) 

§ 7. Constructive Reduction of an Algebra into Simple 

Matric Algebras 

We again assume that the algebra p of order h, which may 
at the same time be considered as a vector space x of h dimensions, 
is completely reducible into irreducible invariant sub-spaces p-. 
The generating units e t of these irreducible p t - are obtained by 
the corresponding reduction of the modulus ; we can then 
express an arbitrary element x of r as the sum 1 of its components 
in the various : 

1 = 2X- («< in !><), * = 2Jxe ( . (7.1) 

i i 

Ifq is a sub-space of x we denote by qa the totality of elements 
of the form xa where # runs through all elements of q ; e } with 
or without index, is an idempotent element, usually primitive ; 
p = Xe the invariant sub-space generated by e ; | the repre- 
sentation of p induced in p by the regular representation. 

We could consider in addition to the reduction (7.1) of t 
into left-invariant sub-spaces the analogous reduction into 
right-invariant sub-spaces by means of the equation 

# = Ze { x . 

i 

But the most complete separation into mutually independent 
components is obtained by carrying out both of these processes 
simultaneously : 


* = Zetxejc = Zx ik . 


(7.2) 
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The elements of the form e { xe k are those of character (e it e *), 
or briefly (ik). Let p ik be the sub-space consisting of all elements 
of this character. The various p ik are independent and the 
entire r is reduced into the sum of the p ik ) the original left- 
invariant p k = Ep ik , The important properties of p ik are given 

i 

by the following : 

Auxiliary Theorem (7.3). I. If p' are two inequivalent 
irreducible sub-spaces with generating units e , e' t all elements of 
character (e, e') are = 0. 

II. The elements of character [e } e) constitute a field or division 
algebra which is simply isomorphic with the system of similar 
projections of p on itself 

Proof I. Let a be any element of character [e ) e'). The 
transformation 

[a] : x-> x f = xa (7.4) 

carries every element x of p over into an element x ' of p ' and 
defines a similar projection. Conversely, we know that any 
similar projection of p on p' is defined by an equation of this 
form, and that the generating element a of character ( e , e') is 
uniquely determined by the projection. If p and p' arc irre- 
ducible our “ typical reasoning ” leads us to the two usual 
alternatives : either the projection associates with every clement 
x of p the image x r = 0 or it defines a one-to-one correspondence 
of p on p'. The equation ea~ a tells us that the first alternative 
is possible only if a = 0, and the second implies that p and p' 
are equivalent. 

II. The above remarks arc applicable to an clement a of 
character (e, e) and the similarity projection of p on itself which 
it generates. If p is irreducible every such projection, except 
the one defined by a = 0, is one-to-one and consequently has 
an inverse. But the existence of an inverse is identical with 
the possibility of division . The isomorphism asserted in the 
theorem is apparent on reversing our usual procedure, and 
reading the resultant of two or more correspondences from 
left to right, for the resultant of the correspondences 

%’ = xa, %" s= x'a r 

is given by 

x n = x(aa') t 

We now proceed with the help of this auxiliary theorem as 
ollows : Arrange the p t into classes of equivalent sub-spaces 
with generating units 


“ * h e t J *i, * 
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and add together the generating units in each of these classes : 


e[ + 


■ / f ft , 

+ e r = s , e x + 


+ ft 

e s 


We then have 

1 = e' + 8 " + • • • (7.5) 

t = %' + t" + • • • (7.6) 

where t', t", • • • denote the inequivalent sub-spaces ts', re", * * • 
into which t is reduced. 

Part I of the auxiliary theorem above then tells us that, 
for example, 

e'*e" = 0. 


Hence the product a'a" of two elements belonging to different 
sub-spaces t', t" is always 0, and the reduction 

a = o! ~|“ cl • • • = ciB- -j“ as "f- * * • 
leads to the multiplication rule 

ftb = a!V + a fr b " + • • •. 

From this it follows that t' is both right- and left-invariant and 
a fortiori constitutes an algebra p (“invariant sub-algebra”); 
s' is the modulus of p . The given algebra is then the direct sum 
of the simple algebras p , p", * • *, where the precise meaning of 
direct sum is defined by the following : 

Let p', p", * * • be algebras (defined over the same field), and 
consider as the elements of a new algebra p, the direct sum of 
p', p", • • •, all sets 

a = (a\ a", ■ ■ ■) 

consisting of an arbitrary element a' of p', an arbitrary a" 
of p", * • \ The fundamental operations in p are defined by 

(a\ a", - • •) + (*', ■ -) = (a' + b', «" + &",• • •), 

«",• • •) = (Aa'; A*", • • •), 

(a', a", • • W, ■ • ■) = («'*', *"b", • • 0 
where A is any number. 

Note that the central of the algebra p obtained by direct 
summation is the direct sum of the centrals of the individual 
algebras p', p", • * \ 

We investigate in detail one of these simple sub-algebras, 
say p', which we now denote simply by p ; its modulus s' 
may now be denoted by 1. On omitting the primes, the de- 
composition of I into equivalent primitive idempotcnt elements 
e { is expressed by 


1 = e x + e 2 + • • * + e r . 
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Every element a of p is reduced in accordance with the formula 
(double Peirce reduction) 

* = 2X* = £(*&*) 

into components of characters (ik). The component c i1e of the 
product c = ab is easily seen to be expressed in terms of the 
components a ikl b ik of a and b by the equation 

r 

c ik = E a U^ik* 
i-1 

We have thus already obtained the connection between our con- 
siderations and the matrix calculus. 

The invariant sub-spaces p l} p 2t " ’ *» Pr generated by the 
e lt e 2 , • * •, e r are all equivalent. Let p be any of these classes, 
e.g. p = p 1} and let r { be any fixed one-to-one similarity corre- 
spondence of pi on p . In accordance with (7.4) any element 

a s a ik = eiaejc 

of character (e i} e k ) generates a similarity projection \a\ of pi 
on ; this projection can be written in the form 

M = /wv* (7.7) 

where a is a similarity projection of £ on itself. But by Part II 
of the auxiliary theorem proved above the similarity projections 
of p on itself constitute a field (division algebra) 0 which is simply 
isomorphic with the set of elements of character (, e , e). If 0 is 
of order v each of the r left-invariant sub-spaces 

Pk = EPik 

c-i 

is of dimensionality g = r • v. The number of times r an irre- 
ducible representation occurs in the regular representation is 
accordingly a factor of the dimensionality g of the representation . 

Any element a can be reduced into its components a ikf 
which may be any elements of the independent sub-spaces 
In accordance with (7.7) 

I \ a ik[ = r i&ihTT 1 (7.8) 

and a i1c may be replaced by the corresponding element <t ik of 
the field 0 . Since conversely any such element ot ik is by (7.8) 
associated with a similarity projection [a,-*] of p t on and there- 
fore with a definite element a ik of character (ik) } we obtain d, 
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one-to-one reciprocal correspondence between the totality of 
all elements a of the Simple algebra p and the totality of matrices 

a n a i2 • * • a lf 
a 21 a 22 # • a 2r 

a u a r2 * • • a rr 

of order r whose components ot ik are elements of the field 0. 
The correspondence is such that to the three fundamental 
operations of the one (addition of elements, multiplication of 
an element by a number and multiplication of two elements) 
correspond to the same operations of the other. Note that in 
particular 

M*] = MM - r> uiy 1 - r&jrt 1 

= r i • * A" 1 - 

We have thus proved : 

Wedderburn' s Theorem . 6 < 9 / /fee simple algebras , whose 

direct sum constitutes the given algebra p ) is simply isomorphic 
with a simple metric algebra in a certain field ( division algebra) 
0 defined over the field of the original algebra . 

( Remark . The invariant sub-space consists of all elements 
a such that the matrix ||a ifc || has as its only non- vanishing column 
the The element e { is then described by that diagonal 

matrix all of whose components vanish except the one occupy- 
ing the t th place, which is 1.) 

It is readily seen that the central of the simple algebra p 
consists of those elements whose matrix (7.9) is of the form 

oc 0 • • • 0 
0 a • * • 0 

0 0 • * - a 

where a belongs to the central of the field 0. 

Our construction was divided into two steps. First r was 
completely reduced into the sub-spaces r', t", • • • which are 
both right- and left-invariant and then these were further 
reduced into the left-invariant sub-spaces hi* We must now 
return to the consideration of the first step. On multiplying 
%s' on the left by (7.5) we find 

x£ f = e'%s', 

and on multiplying e'x on the right by the same factor 

$'x = e'#s\ 


(7.9) 
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Hence 

xs' = s'x ; 

the e', e", • • • commute with all elements and belong to the central 
of the algebra. The sub-spaces t' = p , x", • • • are both right- 
and left-invariant in the sense that neither the transformation 
x’ = xa nor x’ = ax leads out of them, and they are furthermore 
irreducible in this respect — indeed, it is for this reason we call 
them “ simple.” In order to show this we proceed as follows : 

(7.10) . If T 0 is a sub-space which is both right- and left- 
invariant then either e t is contained in r 0 or T 0 £ t = 0. For 
1 0 ei is an invariant sub-space of the irreducible p { and is there- 
fore either 0 or p t itself. In the second case we have 

Pi = -3 t 0 

since t 0 is right- invariant ; hence £,• is contained in x^ 

(7.11) . If e { is in r 0 the same is true of any e which is equi- 
valent to e { . For the similarity projection x r = xb of p { on p 
associates e with some element a { of p t by means of the equation 
e = a t -J, and since a { is in t 0 e is also. 

(7.12) . If t 0 -3 r' then since t 0 = 27x 0 ^' not all the can 

i 

be empty, i.e. one of the e\ must occur in r 0 . But they must 

then all occur in r 0 , hence also s' = Ze v and consequently t 0 = t'. 

< 

(7.13) . Again let r 0 be a right- and left-invariant sub-space. 
Then either x 0 e' = x' or it is empty; in the former case s' is 
in r 0 . It follows from 

r 0 = t 0 s' -f- x 0 e" + • * * 

that x 0 is necessarily the sum of certain of the spaces t', x", • ■ • ; 
when in particular x 0 is irreducible in the sense of right- and 
left-invariance it must coincide with one of the x', t", ■ • •. 
Hence the reduction (7.6) is unique. This further shows that 
every right- and left-invariant sub-space x 0 possesses a generating 
unit i which belongs to the central of the algebra, and that t 
can be completely reduced into r 0 and a supplementary right- 
and left-invariant sub-space. 

(7.14) . If p is an irreducible (left-) invariant sub-space with 
the generating unit e ) then pe' is invariant, and since pe' = e'p 
it is either 0 or p itself. Since 

P = P* + + ■ ■ * 

the equation pe = p must hold for some one of the s', e", • • •, 
while for all others pe = 0. We then say that e belongs to p 
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and that conversely e or p belongs to e. p is a sub-space of the 
right- and left-invariant t e. 

An algebra p = t, concerning which we only assume that it 
is completely reducible into irreducible invariant sub-spaces p it 
is necessarily obtainable by successive application of the follow- 
ing processes : 

(A) Construction of a field ; 

(B) Transition to matrices : we take as elements the matrices 
of a fixed order r whose components are arbitrary elements of 
the field ; 

(C) Direct summation. 

The processes (B) and (C) are formally completely determined 
and are therefore of an elementary character. Hence the 
construction of algebras is reduced to the construction of fields, 
i.e. of special algebras in which division is possible (“ division 
algebras ”). 

The converse is naturally also true : any algebra constructed 
by the three steps (A), (B) and (C) is completely reducible, for: 

(A) If the algebra r is itself a field, r is itself an irreducible 
sub-space of r. For if a is any non-null element of the field 
then fa runs through the entire field with f ; this is merely 
the content of the division axiom. 

(B) The matrices (7.9) in which all components of every 
column except the z th vanish constitute the irreducible sub- 
space p i} and the space r of all matrices is the sum of these p { . 
pi is irreducible ; to show this we must prove that if a is any 
element in p { then any element of p { can be expressed in the 
form xa. a as well as a! = xa has as its only non-vanishing 
column the z th ; dropping the last index i, we denote these two 
columns by 

(<*i, • • •, Or), K. 4. • • •> 4), 

respectively. The equation a' = xa is then 

4 = Z€ik*k ; 

1 

we are therefore concerned with proving the theorem that any 
non-vanishing ‘‘vector” (a x a 2 • * * a r ) can be transformed into 
any given “vector” • a') by an appropriate linear 

correspondence. Since not all the a fc vanish take one of them, 
say a 2 , which does not vanish and let all f £fc for which k 4= 2 
be 0 ; f £2 is then to be determined by the equation 

4 = ; 

that this is possible is guaranteed by the division axiom. 
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(C) The assertion is self-evident for this step. 

In general only the first step, (A), does not lend itself to an 
exhaustive formal treatment. However , if the field over which 
the field (“ division algebra ”) referred to in (A) is defined is 
algebraically closed this step becomes extremely simple : 

The only division algebra of finite order over an algebraically 
closed field is this field itself 

Proof . Consider an algebra of order v defined over an 
algebraically closed field. If a is an element of the algebra 
there must exist a linear dependence between the v -f- 1 powers 
a v , a v ~ x , • • a, 1, i.e. a linear relation whose coefficients are 
numbers of the field. Hence a satisfies an algebraic equation 
of degree m v : 

/(A) = A* 1 + y^- 1 + • • ■ + y m 
f{a) = a m + y l a m ~ 1 + • • • + y m l = 0. 

Since the field is algebraically closed /(A) can be expressed as 
the product of linear factors : 

/(A) = (A- ai )(A-a s ) • • • (A — a m ). 
Correspondingly 

(a — otjl) (a — a 2 l) • • ■ (a — a m l) = 0. (7-15) 

We now introduce the assumption that the algebra of order v is 
a division algebra ; then the product of two or more elements 
can vanish only if one of the factors is 0. Hence we may con- 
clude from (7.15) that a = ocjl for some i ; the algebra then 
consists of the products of the modulus 1 with any number of 
the fundamental field, and therefore the algebra itself is simply 
isomorphic with this field. 

If we are dealing in the field of all complex numbers the 
auxiliary theorem (7.3) can be replaced, in accordance with 
the above, by the more definite : 

(7.3'). All elements of the form ex's are zero if the primitive 
idempotent elements e, e' are inequivalent. If they are equivalent 
all such elements are multiples of one of them ( which is different 
from 0). 

Further : The number of times an irreducible representation 
appears in the regular representation is not merely a factor of the 
dimensionality of the representation ; it is actually equal to it. 
Our analysis has thus revealed the true source of this remarkable 
fact. 

Under these circumstances the given (“ semi-simple ”) algebra 
is the direct sum of simple jnatric algebras over the original field. 
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We obtain a complete set of basal units e' ik , e lK) * * * *• 

* = 2:44 + £44 • • •, (7-i6) 

ik ik 

for the algebra ; these basal units satisfy the multiplication 
law of “ matrix units/’ i.e. products of the type 

44 - 4 (i- 17 ) 

and all others vanish. The correspondences 

a -*■ II4II. a -> II4II, • • • 

are the inequivalent irreducible representations t)', 1)", * ■ •. 
The basal units e' {i , e[ 0 * • * are the generating units e i} e[ y • * • 
of the irreducible sub-spaces with which we began our con- 
struction. e ik is the element of character (ik) generated by 
the correspondence of p* on p k , i.e. that element which 

this correspondence associates with e\. 

After having obtained the irreducible representations in 
this constructive way we derive their orthogonality properties 
again from our present standpoint. For the moment let the 
trace of a denote the trace of the correspondence 

x — > y — ax (7.18) 

of X on itself which is associated with a in the regular repre- 
sentation. In terms of the co-ordinate system defined by the 
basal units above this correspondence becomes 

9 ' 

4 = 2><j4> ' • •• 

j-i 

Each of the g ' columns of variables 

£*) &*, ' • *. (k — 1, 2, • • •, g') 

undergoes the transformation with matrix ||a^j| ; the trace of 
a is accordingly 

S’ ■ Z*i< + • • •• 

By (7.16) this is equivalent to the equations 



for the basal units. Hence by (7.17) 

tr (44) = S’. ‘ ‘ * 


(7.19) 
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and all other types of products of basal matric units have a 
vanishing trace. 

If the algebra is the algebra of a group of order h the corre- 
spondence (7.18) is expressed in the original co-ordinate system, 
consisting of the elements s associated with the elements s of 
the group, by the equation 

y(s) == Zaist-^xit). 

From this it follows that the trace, as defined above, of a is 
equal to h* a(l) ; but in the case of a group algebra we have 
previously called a( I) itself, without the factor h ) the trace of a . 
On returning to this original definition of the trace we need 
merely to replace the right-hand side g' of the orthogonality 
relations (7.19) by g'/h. Equation (7.16) may now be solved 
explicitly for the coefficients : 

<4 =• ~tr {ae'ti) = p • £a[s) • 4 (* -1 )- (7.20) 

6 6 * 

The connection with the development in Chapter III, § 13, is 
obtained by noting that the 

*»(*) = ‘ «k(0 (7.21) 

are the components of the matrix U'(s) associated with the 
element s of the group in the irreducible representation f)'. 
The character of f)' is therefore 

x '(,) = *. 6 '(,-i) * (7 .22) 

and (7.19) yields the orthogonality relations for the representa- 
tions. 

We have thus arrived at a constructive formulation of the 
theory, in which the fundamental concepts involved in and the 
range of validity of each step are clearly apparent. It supplies 
us with a constructive method for obtaining a complete set of 
irreducible representations, as well as establishing the ortho- 
gonality relations. 

Additional remark . In dealing with the continuum of all complex 
numbere and a group algebra defined over this field we can, in accord- 
ance with the remark at the end of § 3, completely reduce the modulus 1 
into real primitive and the space r into the corresponding unitary - 
orthogonal irreducible Further, the projections r { can be normalized 
in such a way that is conjugate to e' it . To show this we note that 
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the conjugate of e %1c is under these conditions an element of character 
(ki) and must therefore be the product of e' H by a number y ik : 

4 = r« * (7-23) 

The rules 

e a e ki = e u> hi = hi e ik 

yield the conditions 

yikVki ~ Yil> Yu — 1 

on the coefficients. Further, y ik is real and positive, for from (7.23) 
and (7.19) we find 

= tr (4 4) = f- ■ Yu- 

We then find that the y ik can be brought into the form y ik — p k /p%, where 
the ft. are positive real numbers (take, for example, p* = y 1{ ) . On re- 
placing the original correspondences r. by p .r. we find that the new e ki 
is actually conjugate to the new e[ k . Our representations l/, f • • * are 
accordingly thrown into unitary form. 


B. Extension of the Theory and Physical Applications 

§ 8. The Characters of the Symmetric Group and 
Equivalence Degeneracy in Quantum Mechanics 

The notation employed in this section is as follows : 77 r = 7Tf 
is the symmetric permutation group of / things, t = p — (tt) 
the corresponding algebra, e a (primitive) idempotent element 
of p ) p = xe the (irreducible) invariant sub-space of x generated 
by e , f) the representation induced in p by the regular repre- 
sentation, g the dimensionality of p and f), x the character of 
f), s that element of the set s', s", • • • (7.14) to which the irre- 
ducible p belongs ; ^ the corresponding symmetry class of 

tensors of order /, consisting of all tensors of the form &F, § 
the representation^ the algebra Z of symmetric transformations 
(and therefore of the linear group c) which is induced in 5$ by Z 
itself. When further differentiation is necessary, we also denote 
this $q by §(x) or § n (x). In case the considerations are valid 
for an arbitrary finite group 7 r, h denotes the order of tt (=/! 
for TT f ). 

Determination of the Group Characters . 

We begin by calculating the character of the representation f). 
To this end we construct the trace of the linear correspondence 

( 8 . 1 ) 


%—> y = ax 
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of p on itself ; the considerations of the previous section show 
it to be 

Za{s)x(s). 

8 

Now consider instead of (8.1) the projection 

x -> y = axe (8.2) 

of the total space r on p ; it coincides with (8.1) within p and 
sends any element x of t into an element y of On choosing 
the co-ordinate system in t in such a way that the first g funda- 
mental vectors span the sub-space p } the last h — g rows of the 
matrix of (8.2) consist only of zeros ; hence the trace of the 
projection (8.2) of the total group space is equal to the trace of 
the correspondence (8.1) inp. In terms of components equation 
(8.2) is 

y(s) = £a(t)x(s')e(t'), ( ts't ' = 5 ) 

and the trace is therefore 

ZZ*{tW) 

8 

where the inner sum is extended over the pairs t , t' of elements 
of the group which satisfy the equation tst' = s , or explicitly, 
the trace is 

2{a(t)Ze(s-H-'s)}. 

I 8 

Hence the character x of is given by 

x(t) = Ze(s-H-'s) 

« 

or 

x ( s ) = Zeirs-h'- 1 ). (8.3) 

In particular, the dimensionality g of the representation f) (and 
the space p) is 

x(l) = h-e(l). 

Resonance or Equivalence Degeneracy, 

The significance of our results for quantum mechanics, as 
first recognized by Wigner , is the following. 7 The complete 
reduction of the tensor space ffi into invariant sub-spaces 
implies a separation of the terms of the physical system If, 
consisting of / equivalent individuals I (electrons), into sets of 
terms which no dynamical influence whatever can cause to 
enter into combination with each other. We have further seen 
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that the reduction of Sft/ into the parallels the complete 
reduction of the total group space t of the symmetric permutation 
group 7r into invariant sub-spaces Hence there is a system of 
terms associated with every irreducible representation f) of tt — 
which' we denote simply as the term system using the 
character x of f) as a name for the system — and the multiplicity 
of this term system is the number m(x) of times that f) occurs 
in the regular representation. This suffers a slight modification 
in case n </, for we must then ignore all hi which are not con- 
tained in t 0 = \ ffl. But since t 0 is both right- and left-invariant, 
all sub-spaces which are equivalent to an irreducible invariant 
h lying i n r o are a ^ so in r o* Hence the multiplicity of the term 
system % is m(x) or 0 according as that s with which the character 
X is associated by (7.22) is in r 0 or not. From the physical 
standpoint, the only additional fact of interest obtained from 
the more extended theory built up on the assumption that the 
number field in which we are operating is algebraically closed 
is that then the multiplicity m(x) is equal to the dimensionality 
g of the representation f). Furthermore, it is impossible to 
resolve this multiplicity by any physical means whatever, for 
corresponding terms in these various term systems remain in 
coincidence under all dynamical influences. 

We consider the resolution of terms in the case in which the 
interaction between the / individuals is expressed by a small 
perturbation energy A W, neglecting higher powers of the small 
parameter A. Assume for the moment that the energy levels 
E * * * of a single individual I are non- degenerate. On 

neglecting the perturbation If possesses energy terms of the type 

E = E 1 + E % + • • • + £/; ( 8 * 4 ) 


we first concern ourselves with such a term. Its multiplicity 
is f 1 and the corresponding co-ordinates in tensor space are the 
coefficients F(i lt • • * v) whose ^ indices are any permutation 
c of 1 2 •’•,/. This coefficient F(t{i 2 ••*»/) 1S the component 
x(s) of the element 

x = F(l, 2 ,•••,/) 

of the algebra (tt). The separation of the term (8.4) is to a first 
app^matioV determined by the reduction of the correspon- 

•v; 


(*) 


F{HH ' ‘ • */) 

diagonal (ora, ; her. the matrix of the -efficients .a «pr««n«a 
the energy and t tf i 2 , ' ' ’ 1 
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5 , t of 1, 2, • • *, /. This equation may therefore be written in 
the form 

x(s) = 2>(s, t) x(t). (8.5) 

t 

The equation 

«(*V ■ • • v ; * • ■ M = «(h *»•••*/) 

describing the symmetry of a, in which 

• • •,/->/ 

s any fixed permutation r, is expressed by 
a(sr, tr ) = a(s, t) 

for the only coefficients in which we are here interested ; r is here 
considered as applied to the indices 1, 2 ,••■,/ themselves rather 
than the sub-indices. Hence a(s, t) depends only on sr 1 : 

a(s, t) = a{st~ l ), 

and equation (8.5) may now be written in the abbreviated form 

(a) : x = ax (8.6) 

where a, x, x are the symmetry elements of the algebra (ir) with 
components a(s), x(s), x(s). 

On restricting ourselves to an invariant irreducible sub-space 
$ of the system space the element x of (n) lies in the corre- 
sponding p. The g terms W lt W 2 , ■ • •, W , into which (8.4) is 
resolved by the perturbation and which belong to the term 
system x under consideration are, to the approximation involved 
in the perturbation 'theory, the characteristic numbers of the 
correspondence (8.6) of p on itself. The sum of these terms must 
therefore equal the trace of this correspondence, or 

Wi + W t + • • • + w, = Ia{s)x{s). (8.7) 

t 

The sum of the squares of these terms, of their third powers, 
etc., are obtained by reiterating the correspondence (a), i.e. 

W\ + W\+ • • • + W] = 2X(i)x(s), (8.7') 

1 

where the a r ($) are the components of the symmetry element 
a T : 

a„(j) = 1 or 0, according as s = I or =f= I, 1 

*r + i(s) = Za r (st-i)a(t). ) i 8 ’ 8 ) 

As soon as the “ exchange energies ” a(s) are known we can 
apply this formula to calculate those of the terms arising from 
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( 8 4 ) which are contained in the term system x ; for this we need 
only to know the character x — it is not necessary to have an 
explicit expression for the idempotent generator « or the 
representation fj of n. 

These considerations are immediately applicable only if we 
ignore the spin phenomena. If we take into account the per- 
turbation due to the interaction of the electrons before that 
due to the spin, as in the case of normal term order, the mere 
existence of spin implies that each of the energies E, is at least 
two-fold. . We shall later concern ourselves with the far-reaching 
modifications, caused by the spin and by the Pauli exclusion 
principle, which enables us to discard the majority of possible 
terms. 

The unperturbed If will have, in addition to terms of the 
type (8.4), terms in which groups of two or more summands 
appear with the same indices. The multiplicity of the term 

fi E i + f* E t + ’ ' * + fJ& v (fi + /* + • • • + f, = f) ( 8 . 9 ) 
with integral non- negative weights is but 



( 8 . 10 ) 


The corresponding tensor coefficients x{s) are those obtained 
from 

2 i_: ; • * •) 

fi /a 

by the permutations s of the / arguments. But a permutation 
p is without effect if it only permutes the first f x indices among 
themselves, the next / 2 among themselves, etc. ; we may no 
longer distinguish between the permutations s and ps — they 
must be considered as giving rise to but one component. Such 
permutations p constitute a group tt' = ir(f l9 f 2> • • •) of order 
h* s=a f x \f % \ * # % and two permutations s, t are to be considered 
as the same if they are left- equivalent with respect to this sub- 
group rr' f i.e. if $ sa t (ps = t ) where p is an element of 7 /). The 
only elements ^ of the algebra (tt) in which we are now interested 
are those which satisfy the equation 

%(t) = x(s) when t = s (mod. tt') ; 

they constitute a linear sub-space x' = 1 ( 7 /) of dimensionality 
(8.10). More precisely, x' is a right-invariant sub-algebra, for 
if s is t then also sr s tr. Again a(s, t) = a(st~ x ) ; further 

a{ps) = a(s), a{sp) = a(s) 


if p is in it'. 
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We are now concerned with the correspondence x x in t' : 

£(s) = Za(sr x )x{t) (mod. tt'), (8.11) 

where the “ mod. tt ” indicates that both s and t run through 
a complete set of elements of the group which are inequivalent 
mod. tt'. As x runs through t', xe generates a sub-space p' of t' 
which is transformed into itself by the correspondence (8.11), 
and the reduction of this correspondence of p' into diagonal 
form yields those terms arising from (8.9) and lying in the term 
system The trace of (8.11) in p' is equal to the trace of the 
correspondence A e : x-+x in t' which is obtained from (8.11) 
by replacing x by xe, i.e. x(t) by 

jgxfr-yefy) = Zx{r)e{r-H), 

r r 

Hence 

tr(A e ) = E Msr')Ee(r-H)}. 

s, t mod. n* r = s 

Since a(sr l ) = a{rt~ x ) when r s s (mod. n), this trace may be 
written 

I Za(rt-')e(r-H). 

tmod.71' r 

Naturally this sum does not depend on which particular element 
t we have happened to choose from the set of group elements 
which are equivalent mod. n • hence on dropping the restriction 
on the range of t the above sum is multiplied by the order h’ 
of tt': 

tr ( A e) = l>Z<rt-')e(r-H) == ^Ea{s) X {s). (8.12) 

Here again *(s) is the character of f) as determined by (8.3). 
In particular, the dimensionality of p', i.e. the number of terms 
in the system x arising from (8.9), is obtained by replacing the 
symmetry element a in (8.12) by the element a 0 defined by 

a 0 (s) = 1 or 0, according as s = I (mod. n’) or not ; 
this number is consequently 

(8.13) 

We express this result, the validity of which is not restricted 
to permutation groups, in the theorem : 

Let tt be a sub-group of tt of order h! and let p be a left-invariant 
sub-space of the group space t of it. Consider the elements x of 



k 
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the algebra (tt) which satisfy the condition #( s ) = x(t), where s and 
t are any two elements of the group it which are left- equivalent 
mod. 7 / ; the elements of (tt) which are of this type and which 
lie in p constitute a linear sub -space whose dimensionality is given 
by (8.13), where x is the character of the regular representation in J). 

The sum of the terms is equal to the trace (8.12), and the 
sums of their powers are given by 

iX (s)x{s) 

EW r^f— (8.14) 

The only way this result differs from (8.7') is by the introduction 
of the denominator / j!/ 2 ! • * * and the fact that a r (s) is now defined 
by 

a t+i( s ) = U^r(sr l )a(t) (mod. tt'). 

Degenerate Case. Denote the numerically different energy 
levels of the individual I by E\ £", • * and the multiplicity 
of by tv We now distinguish between the various variables 
having the same “ principal quantum number ” v by an “ auxil- 
iary quantum number ” k v which assumes n v values. An energy 
level of the type 

£' + £" + •• • + £(/) (8.15) 

of the unperturbed total system V has the multiplicity 
f\n l n i • • • n f , 

and the corresponding tensor coefficients are those obtained 
from those of type 

F a 2 • • • / \ 

V&j k t ■ ■ • k f J 

by any permutation i of the / pairs iy\k) of arguments ; we write 
instead 

x(s\k 1 k 2 • • • k f ) or briefly x(s\k). 

Similarly the coefficients of the energy matrix are denoted by 

a(s\k x ki • ■ • k f ] fj/ji/, • • • I,) = a^r^k ; l). 

The energy levels W arising from (8.15) by the perturbation 
and lying in the term system x are, to a first approximation, 
determined by 

£W* = IIa t (s\k ; k) x (s), 

(*) * 


( 8 . 16 ) 
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where a 0 (s\k ; Z) = 1 or 0 according as s = I, ft = l or not, and 
the composition is defined by 

a r+x (5|ft ; Z) = 2X(sT l |* ; m)a(z|w ; Z). (8.17) 

If the unperturbed energy level is of the form 

/'£'+/"£" + • • • (/'+/" + • 

the tensor coefficients in which we are interested are those ob- 
tained from 

2 2 ...... -y 

\fe u ft 12 • * • ’ k n k n • • • * V 

7 ' s r ' 

Let exactly / x of the auxiliary quantum numbers ft lv (v = 1, 

• • *, f) have a certain value ft x , a different value fe 2 , etc. ; 
/[+/« + * ’ • = /', and let / x , /£, • • • have the analogous 
meaning for the quantum numbers ft^v = !,•••, /") associated 
with the principal quantum number 2, etc. Then those per- 
mutations p which leave the above tensor coefficient unchanged 
constitute a certain sub-group 7 r*, depending on the distribution 
of auxiliary quantum numbers ft, of the group tt introduced in 
the non- degenerate case above ; the order of tt * is [ft] = /[[fy 

• • • f[\ • ■ •. a(s[ft; Z) is unchanged when 5 is multiplied on the 

left by an element of ti£ and on the right by an element of ir' lm 
The formula (8.16) now becomes ' 

Z WT = ; %(*)} (8.18) 

a 0 (s |fe ; l) = 1 or 0 according as ft = Z and s s I (mod. 7 ^) 
or not, and in the composition rule (8.17) we first sum with 
respect to t mod. and then over the various possibilities 

m = (m n , m u , • • • ; m 21t • • • ; • • •). 

In every case we obtain explicit expressions for the sums of 
the various powers of the perturbed energy levels in terms of 
the character x of the term system under consideration and the 
exchange energies a(s). 

§ 9. Relation between the Characters of the Symmetric 
Permutation -and Affine Groups 

The thorough correspondence existing between the repre- 
sentations of the symmetric permutation group ir f and the 
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representations of order / of the linear group c must lead to 
a simple relation between the corresponding characters. In 
dealing with the linear group it suffices to consider only the 
“ principal transformations ” 

x t -> Si (i = 1, 2, • • n) (9.1) 


of the vector space = 9f n , for any linear transformation is 
conjugate within c to a principal transformation — except for 
those cases in which two or more of the characteristic numbers 
Sf coincide. Furthermore, if we restrict ourselves ab initio to 
the unitary group u — the one in which we are interested in 
physics — the result is valid without exception and the s t - are 
complex numbers of unit absolute value. The problem here 
proposed is identical with that of investigating the distribution 
of the terms of V among the various term systems x i n the 
absence of interaction between the various individuals and when 
the single system I is non-degenerate, for on choosing a Heisen- 
berg co-ordinate system x { in the system space of I (i.e. one in 
which the operator representing the energy of I is in diagonal 


form) the variable x t assumes the multiplicative factor e 



in time t. 

We denote the characteristic * of the representation § of 
the linear group whose substratum consists of all tensors of the 
form eF by X(5) or X(s 1 , e 2 , * • *, e n ) where the element S of c is 
the principal transformation (9.1). The s x are to be considered 
as n independent variables. The transformation of tensor space 
associated with (9.1) consists in multiplying the coefficient 
F(i h i 2 , * * •, if) of the tensor F by s ijL • e tjt • • * e if . The sum of 
all these multipliers, extended over all linearly independent 
coefficients of a general tensor of the form F f = &F } is the desired 
characteristic. A component in which f x of the arguments i are 
equal to 1, / 2 are equal to 2, • • - is multiplied by e{ 1 - efc* * • e£ rt . 
But the number of linearly independent components of F' of 
this type is, by equation (8.13), 


_Xx(£)_. 

m ■ ■ 


(9.2) 


here x 1S the character of the representation f) of i r f) the sum 
being extended over all elements s of the group 7 r' = rr{f X} / 2 , * • •) 
which permutes the first f x numerals among themselves, the next 
/ 2 among themselves, etc. That this number (9.2) depends only 


* We prefer, fox the sake of clarity, hereafter to employ the word 
" characteristic " for continuous and " character " for finite groups. 
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on the character x is a fact of greatest importance for our present 
considerations. The result is 8 



where the inner sum is extended over all the elements s of 
■"■(Ai fit ' ‘ ')• We denote the value of the character x for an 
element 5 belonging to the class f of conjugate elements of 
TT f by x(f) ; our formula may then be written 


X 



c hH 


(k) 


m 





(9.4) 


where c flfl . . . (k) is the number of elements of ir(f lt f tl • • •) 

belonging to the class f. This number can be evaluated in an 
elementary manner. 


Distribution of Permutations in Classes. 

Any permutation s is a product of cycles, no two of which 
contain a common numeral. The 5- term cycle (1 3 7 2 4) is 
a permutation which sends 1 into 3, 3 into 7, 7 into 2, 2 into 4, 
and 4 into 1 again ; writing these 5 numerals at equidistant 
intervals on the rim of a wheel, this permutation may be con- 
sidered as the rotation of the wheel about the angle 2n 5. Given 
any permutation, for example 

123456789 

(9.5) 

3 4 7 1 9 8 2 6 5, 

the cycles may be separated out by first determining the number 
(3) into which 1 is transformed, then the number (7) into which 
3 is transformed, etc., until a number is obtained which has 
already appeared in the cycle ; this number can, of course, 
only be 1. After separating out the first cycle the remaining 
numbers can be handled in the same way, and the process may 
be continued until the desired result is obtained. The per- 
mutation (9.5) is, in terms of its 3 cycles, 

(1 3 7 2 4) (5 9) (6 8). (9.6) 

The reduction of an arbitrary permutation into its cycles is 
obviously unique. This way of writing the permutation enables 
us to tell at a glance whether two given permutations are con- 
jugate in 7 T f or not, for an element conjugate to ( 9 . 6 ) is obtained 
by replacing the numbers 1, 2, 3, 4, * by the same numbers 
in any order. The class I to which an element 5 belongs is thus 
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determined entirely by the number of cycles and the number 
of integers they contain ; in particular, any permutation 5 and 
its inverse s~ l belong to the same class. We denote the class 
f whose elements s consist of i x cycles with one numeral, i 2 with 
two, z 3 with three, * • * by (i x i 2 z 3 * • •) and write x(f) = x(h H ***)>* 
naturally 

Hi T* 2i 2 + 3 z 3 + • * • == /. (9.7) 


The number K of classes is the number of solutions of (9.7) with 
non-negative integers i x , i 2 , i z , • • *. 

The number of elements in the class f = * * *) is 


l < it 1 !2 i «t g !3*Uj! • • •. 


To show this we write the / integers 1, 2, • * % / in any of the 
/! possible orders and divide off each of the first i x integers by 
parentheses, then divide off the next 2 i 2 in groups of 2, the next 
3z 3 in groups of 3, • • \ The symbol so obtained is to be inter- 
preted as the expression of permutation in terms of its cycles. 
Each of the/! possible arrangements so obtained leads to a definite 
element s of the class f, and all such elements must be included. 
We must now investigate how often the same s occurs among these 
/!. Now the 5-term cycle (1 3 7 2 4) can also be read as (3 7 2 4 1), 
(7 2 4 1 3), etc. : the particular integer with which we begin is 
immaterial ; such a cycle will occur five times. Hence those 
lh 2 l ® 3 l *a • * • arrangements which differ only by a cyclic per- 
mutation of the numerals in each cycle are all associated with 
the same element s. Furthermore, the i x 1-term cycles may be 
written down in any order, the i 2 2-term ones in any order, etc., 
and these i x \i^ * * * arrangements all lead to the same element s. 
Hence each element occurs exactly l , ‘it 1 !2 , ‘*i 2 ! * * ’ times, and the 
total number of elements in the class is accordingly given by 
(9.8). 

We must also determine the number of elements of f which 
are contained in the sub-group 7r(f h f 2l • • •). For this purpose 
we divide the numbers from 1 to / in sections of lengths f 1} 
/ 2 , • * * and consider only those permutations 5 which permute 
the numbers of the first section among themselves, the numbers 
of the second among themselves, etc. On dividing 5 into cycles 
as in the above some of the cycles will be contained in the first 
section, i.e. will consist only of numerals belonging to the first 
section, some will be contained in the second section, etc., and 
no cycle will consist of numerals belonging to different sections. 
Denoting the number of 1-term cycles contained in the first 
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section by i u , the number of 2-term cycles in this section by 
i ia , etc., whence necessarily 

1*11 + ' 2*12 + 3*13 + * ’ ‘ — fu 


the number of permutations of 1, 2, • • *, / x satisfying this 
requirement is, by (9.8), 


A! . 1 

*u ! *i2 ! * • • l* u 2*>» • • •* 


(9.9) 


Proceeding analogously for the 2 nd , 3 rd , etc., sections, the number 
of permutations in -nQ-Jz • • •) satisfying all our requirements is 
given by the product of all numbers of the form (9.9) for the 
various sections. But such an element is a member of the 
class ! = (i'ii 8 • • •) if and only if 

£*«1 = * 1 ) J ?*“2 ~ * 2 j ‘ ‘ ‘ ! ( 9 . 10 ) 

« a 

hence 



(i) * 


where the sum is extended over the various solutions of equations 
(9.10) and 

i^y = ^ fi) * * *• 

V v 

The inner sum in (9.4) is accordingly 

1 v-'f T"! 6 * 1 ’ 01 } 

iw* . ■ .2All*«ii ‘ *w ’ y 
«) « 

the only restriction on the sum being the conditions (9.10). Let 

a i = s i 4* e 2 4- * * * + e„, 

*2 = 4 + si + • • • + 4 


Our results can be expressed entirely in terms of these sums of 
powers, for by the multinomial theorem 
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where the variables i al ,4 2 , * * *, over which the sum is extended, 
are subject to the restrictions (9.10). We thus finally obtain 
the simple formula 


X(£i, s 2 , 



X(h h ‘ • Qgj • ‘ ’ 
l* 1 2‘ ! • • • i x l i 2 \ • • • 


(9.11) 


We have so far made use only of the elementary connection 
between the groups tt and c. If we now introduce the assumption 
that the number field over which our algebras are defined is 
algebraically closed, and is in particular the continuum of all 
complex numbers, the primitive characters of the finite group tt 
have the orthogonality properties 

Zn(t)x(Mn = h, 

l 

= 0 (x 4= x')‘ 

f 

Furthermore, the number of primitive characters is equal to 
the number K of classes. The above relations assert that the 
matrix of the x(f), where x runs through the entire set of primitive 
characters and f all classes, has as its reciprocal the matrix 

i-nflxOf- 1 ). 

Hence we also have 


smo-') = 

fx(l'))rfn = 0 for V + 1. 


This is, in fact, merely an alternative form of the completeness 
theorem. In dealing with the symmetric permutation group rr f 
! _1 = ! and the order is h = /!. 

On multiplying the expression (9.11) for the primitive 
character X by %{HH ’ * 0 an ^ summing over all the primitive 
characters % of ny, we obtain, with the aid of the relations 
derived above, the important formula 


Oi l 0*2 * * * = Ixihh * * *) X ( e n e 2 , * ’ h £ «) 
x 


(9.12) 


where x and X are the characters of corresponding irreducible 
representations of 7 iy and c n . 
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§ 10. Direct Product. Sub-groups 

Programme . 

If two atoms or ions with f X) f 2 electrons, respectively, come 
together to form a molecule we may to a first approximation 
neglect the interaction between the two atoms so long as the 
distance between them is relatively large. In this approximation 
the two kinds of electrons are dynamically different, for the 
electrons of each atom are influenced only by the nucleus and 
the remaining electrons of the same atom. The symmetry is 
therefore described by the sub-group n of the symmetric group 
7 T = 7t f of / = A + A things in which the first A and the last f 2 
things are permuted among themselves. A similar situation 
arises when three or more atoms come together to form a 
molecule. These considerations immediately suggest the follow- 
ing problems. 

I. The theory developed in §§ 2-4 is to be extended to the 
case in which the symmetric permutation group is replaced 
by any permutation group 7 t. Naturally the definition of a 
symmetric transformation in tensor space is to be adapted to 
the new situation : we require only that the coefficients 
a(i x •••»/; &i • ' • k f ) of (1.2) remain unchanged under an 
arbitrary permutation belonging to the group i r of the sub-indices 
1, 2, • • *, /. We say that these transformations are symmetric 
with respect to it' ; they constitute an algebra Z' which is 
obviously more extensive than Z . — This question is immediately 
settled by the remark that all our previous deductions are valid 
for an arbitrary permutation group n . Here it' is considered as 
an independent group rather than as a sub-group of the sym- 
metric group. 

II. Let the set of integers from 1 to / be divided into two 
or more sub-sets. We consider, as an example, the case of 
two sub-sets : the “ red ” numerals from 1 to f x and the “ green n 
ones from 1 to/ 2 ; A + A = /• Let tt' consist of all permutations 
of the red among themselves and the green among themselves. 
Hence a permutation s' = (s x , s 2 ) of n consists of a permutation 
s x of the A red numerals and a permutation s z of the green ones ; 
it' is the direct product tt x x rr 2 of the symmetric group ir x of f x 
and tt % of A things. Or conversely, this direct product — the 
abstract definition of which has nothing to do with the group 
of permutations of / things — may be considered as a sub-group 
7r' of the symmetric group of / = A + A things on arranging 
the sets of numerals, on which permutations of 7 t X} 7 t 2 act, one 
after the other to form a single set. But here we are interested 
in the following problem (which can be proposed for arbitrary 
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finite groups) : to discuss the properties of a group t r x x 7r 2 
w k*ch is the direct product of two finite groups 7r Xj 7 r 2 . 

Ill- In order to discuss the structure of molecules we must 
eventually take into account the interaction between the various 
atoms or ions contained in the molecule. This means that we 
must finally return from the sub-group 7r' to the full symmetric 
group tt, so we must examine the relations existing between the 
group tt and its sub-group 7 t'. Here again the problem is not 
restricted to permutation groups. 

Direct Product. 

Let 7r Xj 77 2 be two finite groups of orders f x , / 2 respectively. 
The elements of the direct product tt = tt x x 7t 2 are the pairs 
s z) consisting of an element s x of 7 r x and an element s 2 of 
7 t 2 . An element of the algebra of 7 r is accordingly a function 
^2), and it follows from this that the algebra of tt is the 
product of the algebras (7^) and (t7 2 ) : 

(tt) = (7 T X ) x (t7 2 ) 

in the sense of the X -multiplication of vector spaces introduced 
in II, § 10 . An element x 1 : x^Sj) of (77^ and an element x 2 : 
# 2 (^ 2 ) (^2) yield the element x = x x X x 2 ol (tt), whose com- 

ponents are given by 

^2) “ ^1(^1) * #2(^2) • 

Indeed, given any two algebras p x , p 2 , their direct product 
P = px X p 2 can be constructed and multiplication in p defined by 

(a x X a 2 )(b x X b 2 ) = (a l b 1 x a 2 b 2 ) 

whether they are group algebras or not. 

If p is a linear sub-space of r a = p* (a = 1 , 2 ), an element 
x : x(s lf s 2 ) of (tt) is in p = hi X p 2 if and only if it belongs to 
when considered as a function of s 1} holding s 2 fixed, and to 
p 2 when s x is held fixed ; indeed, any element of this kind can 
be expressed as a linear combination of products of the form 
a i X a 2 , where a x is in p x and a 2 in p 2 . If $)*(a — 1 , 2 ) is an 
invariant sub-space of x*, generated by the idempotent element 
e a and the representation space of the representation f)* of p M 
induced in by the regular representation, then p is also 
invariant, has as generating idempotent element e = e x x e 2 
and is the substratum of the representation f) x X f) 2 of p. It is 
evident that the equivalences p x ~ p' 1} p 2 ~ p 2 imply the equi- 
valence px X p 2 ~ pi X p2 • 

Suppose the two p a considered above are also irreducible 
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with respect to their algebras p a ; the question then arises as 
to whether hi X p 2 * s irreducible (with respect to p) and whether 
p — hi X p 2 is equivalent to p f = hi x p' 2 {p* irreducible) only 
if Pi/^ Pi, P 2 P 2 * P and p ' are inequivalent if exe' = 0 
identically in x , i.e. if the sub-space consisting of elements of 
character (*, e') contains only the element 0 ; here e = e x X e 2) 
e f = e[ X 4* Now the formula 

(*1 X e 2 )(xj X x 2 )(e[ X 4 ) = *i %4 X *2*2*2 

shows immediately that the sub-space ( e , £') is the direct product 
of the two sub-spaces (e X) e[) and (e 2 , 4), and can consist merely 
of 0 only if one of these two sub-spaces consists merely of 0, 
i.e. only if hi is inequivalent to 4 or p 2 is inequivalent to £ 2 * 
Our second question is thus answered in the affirmative — regard- 
less of the nature of the field over which the algebras are defined. 

The first question is answered in the affirmative in III, § 9, 
for the only case of physical interest, i.e. that in which the field 
is algebraically closed. If we are more interested in the re- 
duction of the algebra than in the representations we can argue 
as follows. The algebra of elements of character ( e , e) is the 
direct product of the field [division algebra) of elements of 
character [e lf e t ) in p x and the field of character (e 2 , e 2 ) in p 2 * 
Assuming the original field is algebraically closed, all elements 
of are multiples of e„ and consequently all elements of p 
with character [e ) e) are multiples of e . This proves the irre- 
ducibility of hi X ha* If, however, the original field over which 
the algebras are defined is not algebraically closed our assertion 
is correct only if the direct product X # 2 of the two fields 
is again a field, and this is by no means always the case. But 
in any case the question concerning the nature of the direct 
product of algebras is, as in the question concerning the structure 
of an algebra in § 7, reduced to the analogous problem for fields 
(division algebras). 

Again taking the fundamental field to be the continuum of 
all complex numbers, the complete reduction 

h = M k) 

< h 

into irreducible invariant sub-spaces h* has as a consequence, 
in accordance with the above, the reduction of x = r, x X 2 into 
invariant irreducible sub-spaces hi 0 X h^- 

Sub-groups . 

, f sub 'S rou P of the given finite group n. An element 

x of the algebra r — p' = («■') of it’ consists of components x'(s') 
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associated with the various elements s' of it' . However, such 
an element can, and in the following will, at the same time be 
considered as an element of the algebra p = (tt) ; we need only 
to define the components x'(s) associated with elements 5 of tt 
which are not contained in tt' as zero. This disturbs in no way 
the addition and multiplication of elements of (tt') with each 
other or with arbitrary numbers of the field. ,An element x of 
(tt) 44 belongs ” to tt' or 44 lies ” in (tt') if and only if all com- 
ponents x(s) associated with elements 5 of the group that are 
not in tt vanish. 

An irreducible invariant sub-space p' of x' is generated by a 
primitive idempotent element e' and is the substratum of a 
representation f)' of tt' induced in p' by the regular representation. 
On reducing the modulus 1 of tt' into independent primitive 
idempotent elements 

1 = !>; + •■• ( 10 . 1 ) 

i = l 

a certain number, say g', of elements e\ will appear which are 
equivalent to e ' ; the sub-spaces p\ which they generate are all 
equivalent to p' and the regular representation of tt' contains f)' 
g' times. Equivalent summands are added together into 
such partial sums. Considered as an element of the total 
algebra p = (tt) e' is, however, in general reducible into inde- 
pendent primitive idempotent elements : 

*’ = Z** + • * • ( 10 - 2 ) 

a — 1 

Here again equivalent summands on the right are collected 
together into partial sums ; let the in the first such partial 
sum generate the representation f) of 7r — we shall in the following 
be interested only in these. Let the sub-space p with the 
generating unit e be a representative of the sub-spaces p a gener- 
ated by the e a . The elements of (tt) of the form xe' constitute 
an invariant sub-space <£'> which is the substratum of a re- 
presentation <f)'> of tt induced in p' by the regular representation 
of tt. Our formula asserts that cm reducing <f)'> into its irre- 
ducible constituents f) occurs exactly b times. 

In order to obtain a simple characterization of the elements 
of ip'} we divide the elements of the group tt into sets of group 
elements which are equivalent mod. tt' ; the u th such class 
consists of the group elements o u s', where s' runs through the 
sub-group tt'. An element x of the algebra ( 77 ) has as components 
x(a u s') ; the numbers x(a u s') may, for fixed u ) be considered as 
the components of an element x' u of the algebra ( 7 r'), so that x 
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may be considered as the set of elements x' u belonging to the 
algebra (tt'). The formula y = xe f then becomes y' u = x' u e' in 
(tt ; ) : hence x belongs to <p'> if and only if all the partial 
elements x' u lie in The correspondence 

x — > y = ax 

may then be written 

y(<? u s') = Z Z a(c u s't'- 1 or~ 1 )x(<j v t') 

v f'inn' 

or 

y» = Za’uX 

V 


where a’ uv is the element of the algebra (tt') defined by 

a uv (s’) = aXs’vv 1 )- 

The representation <!)'> may therefore be constructed as follows : 
first associate with the element a of (n) the matrix ||a(J|, the 
coefficients of which are elements of the algebra (tt') instead of 
numbers, and then replace each a‘ uv by the matrix A' uv associated 
with it in the representation f>' of tt'. 

As we have seen in the earlier part of the present chapter, 
the representations are obtained with the aid of a double Peirce 
decomposition ; we therefore consider the elements x — e'xe' of 
character («', e'). The idempotent elements e a , • ■ • appearing 
in (10.2) are of this character, and such an element x may be 
expressed in terms of its components 

» 

x = Z e a xe ? + • • •• (10.3) 

We now repeat the analysis of § 7 for our more restricted set 
of elements: let be a one-to-one similarity correspondence 
of p x on p and let the element into which e x is sent by the corre- 
spondence 1 be denoted by e x p*. If, as we now assume, 
the field over which the algebras are defined is algebraically 
closed e^e p is necessarily a multiple x a , of e a0 . We then obtain 
instead of (10.3) the reduction 

* = «*/>+•• •, (10.4) 

(where the x a0 are numbers) and the representations 


'•* (10.4') 

38 in S 7. but in contrast -with our usual notation, the product of 
two or more correspondences r is to be read from left to right. V 
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Now if in particular x is in (n) then # = e'xe' is a numerical 
multiple of (10.2), and the matrix \\x a ^ associated with such an 
element is a multiple of the unit matrix. — The degree of the 
secular equation, the solutions of which determine the character- 
istic numbers, is thus decreased from g to b for an element x 
of character (e\ e '). We now proceed to examine the cause of 
this. 

Let r:- 1 be a one-to-one similar correspondence of p' on p'. 
(i — 1, 2, • • •, g'), and let the element into which it sends e' 
be On considering an arbitrary element x of the algebra 
of 7 r as the set x u) we see that the correspondence 

xe ’ -> xb[ 

is a one-to-one reciprocal and similar mapping of <p'> on <p/> .* 
the projection F[ of p'. on p gives rise to such a projection of 
<p/> on <p'>. This projection associates with the reduction 
of <p'> into irreducible invariant sub-spaces a reduction of the 
same kind of the sub-space <p t '> ; corresponding to equation 
(10.2) we obtain the equations 

e\ = Ze ai + • • •. (10*5) 

<%■=* i 


On combining (10.1) and (10.5) we obtain a reduction of the 
modulus 1 into independent primitive idempotent elements of 
(tt). Now consider the partial sums £e\ of 1 and their reductions 

X 

(10.5) as written one above the other. Each row is then as- 
sociated with a definite representation f )' of n and each column 
on the right-hand side, the terms of which are sums of the form 
ZZ e » £, is associated with a definite representation f) of 7r. We 

i » 

now collect together all the summands ej occurring in the first 
column on the right, i.e. all those elements ej which are equivalent 
to e. The set of indices J is then broken up into sub-sets, each 
of which is associated with one of the inequivalent irreducible 
representations !)',••• of it ; the first of these sub-sets, which is 
associated with f)', consists of the bg' double indices az. 

Let the similarity projection of p[ on p ' h send e\ 

into e i; t If x f is an element of (■ n ) the equation 


X* = Z e i % e k “f* ’ * • 

if k 

yields the reduction 

x'~Zx*e i;k +> * • 


( 1 0 . 6 ) 
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with numerical coefficients x' lK , and %' -> 114*11 is the representa- 
tion I)'. (The partial sums should preferably be written one 
above the other rather than horizontally.) f/ may be con- 
sidered as a similarity transformation of <p(> on <p'> and 
therefore contains a transformation of the same type of on 
T'iT a then provides us with a similarity correspondence 
of on p. Let Tj be a fixed one-to-one similarity correspond- 
ence of pj on p and let the similarity correspondence JyT* 1 0 f 
pj on p K send ej into ej- K - We may take the correspondence 
r;r„ as Tj for the index J = at, and similarly for the remaining 
sub-sets. On applying the correspondence r\r ' k ~ l = 
to equation (10.5) we find 

4 * = £■**<;«*+• • •• ( 10 - 7 ) 

C*“l 

The equation 

x = £ejxe K -)-••• = £xjk e J; K + * • • (10.8) 

J,K J, K 

then determines the representations 

f) : x-+ lltyxll ; • * •• 

By (10.6) and (10.7) the matrix associated with an element % 
of (t O is 

X oA; ph == X JK = 0 

where the two indices J and K belong to different sub-sets. 
But this means that on restricting v to n the representation f) 
is reducible into the irreducible representations fy, • * * of tt, 
I)' appearing exactly b times. We have thus obtained a con- 
structive proof of the theorem 9 : 

First Reciprocity Theorem {for arbitrary groups). If <1}'> 
contains the representation f) of tt exactly b times , then on restrict- 
ing the group n to tt\ f ) contains the representation f)' of tt' exactly 
b times . 

If the sub-group tt' consists merely of the unit element 1 
this theorem reduces to our previous result : the number of 
times an irreducible representation appears in the regular 
representation is equal to its dimensionality. Both the com- 
plete theorem and this special case depend on the assumption 
that the field over which the algebra is defined is algebraically 
closed. 

Connection with Symmetry Classes of Tensors . 

We apply the results of our investigation III to the symmetric 
group 7 t and make use of the correlation described in I above for 
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it as well as for its sub-group tt\ An irreducible sub-space p 
of ( 77 ) determines a symmetry class of tensors ; let the 

corresponding representations of tt and the linear group C be 
f) and §, respectively. An irreducible invariant sub-space p f of 
(7 t) determines a symmetry class ^5' of tensors which is invariant 
with respect to the more extensive algebra Z r of all transforma- 
tions which are symmetric with respect to 77 ' ; as such is 
irreducible. If e' is the generating unit of $)', 9(3' consists of all 
tensors of the form &F ; but this is equivalent to saying that 
the symmetry element F of (tt) belongs to (p'y. Hence the 
reduction of $}$' into irreducible invariant sub-spaces with respect 
to the more restricted algebra 2 parallels the reduction of <p'>. 
Let f)' be that representation of 77 ' induced in p' by the regular 
representation of 77' and §' that representation of c whose sub- 
stratum consists of all tensors in the symmetry class Hence 
our general theorem — or rather its converse, the truth of which 
follows immediately from the theorem itself — allows us to state 
the 

Second Reciprocity Theorem ( applicable only to permutation 
groups). If the irreducible representation f) of tt contains the 
irreducible representation f)' of tt' exactly b times when considered 
as a representation of the sub-group 77 ', then conversely the repre- 
sentation of C contains the representation § exactly b times. 

Finally we take tt' as 77 x X 7 t 2 as in step II above, p' can 
then always be taken in the form p x X p 2 , and the irreducible 
invariant sub-space p M of (nf) determines a symmetry class ^ 
of tensors of order f a (a == 1, 2). Denote the corresponding 
representations of 7 r a and c by f) a and The associated 
with p' = p x X p 2 consists of all tensors of order / = A + / 2 
which satisfy the symmetry conditions of 9£i with respect to 
their first f x indices and the symmetry conditions of $ 2 with 
respect to the last / 2 ; i.e. X $ 2 - Our theorem now 

becomes : 

Third Reciprocity Theorem (for permutation groups). If the 
irreducible representation f) of tt contains , on restricting tt to the 
sub-group n — tt x X tt 2 , the representation t> a X f) 2 °f ^ exactly 
b times (f)* an irreducible representation of nf), then conversely the 
representation X $ 2 °f c contains the representation § exactly b 
times. 

§ 11. Perturbation Theory for the Construction of 

Molecules 

We return to the investigation of the physical system V 
consisting of / electrons or equivalent individuals I. As long 
as we disregard the interaction between the individuals we obtain, 
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among others, /I-fold energy levels E of the type (8.4). We 
consider in particular the case in which the are different 
simple levels of the individual /. In order to follow the resolu- 
tion of E ) due to the mutual interactions of the electrons, to 
the approximation which characterizes the perturbation theory, 
we must first determine the elements a of the algebra of tt, the 
components a(s) of which are the exchange energies, and trans- 
form the matrices corresponding to a in the various irreducible 
representations of into diagonal form by an appropriate 
change of co-ordinates (§ 8). We now assume that the most 
important of the exchange energies a(s) are those belonging to 
the permutations s of a certain sub-group rr of n ; all others 
shall be small in comparison with them (“ quantities of 2 nd 
order ”). Our procedure is divided into two steps, corresponding 
to the investigation of sub-groups carried out in the preceding 
section. Let a' denote that element of the algebra (tt') which is 
defined by 

a'(s) = a(^) or 0 

according as s is an element of the sub-group tt or not, and let 
the matrices associated, with a! in the irreducible representations 
I )' of tt be referred to principal axes ; then 

e\a!e k = 0 (i 4= k), e^a'el = W { • e[. 

The characteristic numbers W { are the energy levels on neglecting 
perturbations of 2 nd order; we assume they are all different. 
In order to examine the further resolution of such a term 
W = Wi under the influence of the 2 nd order perturbation we 
need, in accordance with the perturbation theory, to consider 
only that part 

a* = e'ae' 

of a which is of character (e', e'), where we have written e ' in 
place of e\. This term yields b terms W a belonging to the 
symmetry class x associated with the irreducible representation 
lj of tt, the values of which are the characteristic numbers of 
the matrix \K$ associated with the element a* = e'ae' as in 
(10.4'). All the algebraic elements appearing in these con- 
siderations are real and the corresponding matrices are con- 
sequently Hermitian. 

We apply the procedure to the process by which molecules 
are constructed from their constituent atoms. 10 We consider 
as an example two atoms joining to form a molecule, the one 
containing and the other f z electrons ; / = f x + / 2 . We 
consider the two nuclei as held fixed at a distance d apart, which 
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is large compared with the linear dimensions of the atoms, and 
attempt to determine their interaction energy as a function of d. 
The sub-group 7 / = tt x X 7 t 2 consists of all permutations which 
send no electron of one atom over into the other ; we have seen 
in § 10 that we may then take the primitive idempotent elements 
e[ == of the algebra (77-') in the form e Y X e 2 , where e 1} e 2 are 
in (-tti), (??%) respectively. On neglecting the interaction between 
the electrons of the one and the electrons of the other atom we 
obtain an energy term W which belongs to definite symmetry 
states of both atoms. e f generates a sub-space ^ x 

(of the tensor space which is invariant under all symmetric 
transformations ; that the state of the molecule is described 
by a tensor of this sub-space means that the state of the first 
atom is in and that of the second in Hence on reducing 
^5' in parallel with the reduction of < p'y into irreducible in- 
variant sub-spaces : 

e' = + • ‘ •, <*>'> = + •••,$' = + • • •, 

a > <x a 

there occur b sub-spaces which are equivalent to one another 
and which belong to a certain representation of tt or to a certain 
symmetry class of terms of the total system. The procedure 
sketched in the preceding paragraph thus leads to b terms which 
(1) arise, due to the perturbation, from the given unperturbed 
term (8,4) and (2) which belong to certain given symmetry 
states Xii Xt and X °f the two atoms and the molecule. This 
reduction of the total system space ffi into sub-spaces, each of 
which corresponds to a definite symmetry state of each of the 
atoms taken separately and of the molecule, naturally is not 
bound up with the approximate calculation of levels with the 
aid of perturbation theory ; the connection between the two 
appears only on taking the above condition (1) into account — 
the very essence of which implies the assumption of small per- 
turbations. This somewhat sketchy account of the situation 
arising from an unperturbed term of the type (8.4), in which 
the energies E { of the individual I are non- degenerate, can readily 
be extended to cover other more complicated types of unper- 
turbed terms. These other cases are of course of much greater 
physical interest, for we have seen in Chapter IV that all atomic 
energy levels, except 5- terms, are necessarily degenerate. 11 

The fact that the total system may be in any one of several 
symmetry states ?$, corresponding to different energy levels 
(i.e. binding energies), when the symmetry states of the com- 
ponent atoms are given is of greatest importance. We shall 
later show that these possibilities, finite in number, coincide with 
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those predicted by the empirical theory of the valence bond, and 
that consequently the symmetry state of an atom is that which 
chemists call its valence state. The situation thus arising cannot 
be described adequately in terms of classical models — e.g. the 
fact that the two H atoms constituting an H 2 molecule can be- 
have in such a way that the state of the molecule may lie in 
either the space of symmetric or anti-symmetric tensors of 
order 2 ; only the first case can lead to an attraction which will 
bind the atoms together — the second always results in a re- 
pulsion. 12 The binding energy between two ions of total residual 
charges e l7 e 2 is naturally due mainly to the Coulomb potential 
e j0 2 /d (“ ionic binding ” or 14 polar bond ”), but the corresponding 
energy for two neutral atoms is due for the most part to the 
interaction of the “ exchange energies ” a(s ) of the electrons of 
the two atoms (“ atomic binding ” or “ non-polar bond ”). 
This quantum-mechanical solution of the puzzle offered by the 
non-polar valence bond was first given by F. London and 
W> Heitler. 

The following points are to be taken into consideration in 
applying the theory of perturbations to the actual evaluations. 
On neglecting the interaction between the various electrons 
each is subject only to the attraction of the two nuclei ; we 
should therefore perhaps begin with the characteristic numbers 
E { and the corresponding characteristic functions $ i(xyz ) of 
this one-electron problem. The first approximation should then 
be obtained by taking into account the repulsions between the 
electrons of each of the atoms separately, thus introducing a 
dynamical difference between the two kinds of electrons. This 
procedure is naturally significant only so long as the distance d 
between the atoms is large in comparison with their linear 
dimensions a . But then it is also reasonable to take as our 
0 th approximation that in which each of the electrons is subject 
only to the attraction of its own nucleus (plus the closed shell 
of electrons which are not to be taken into explicit account in 
the calculations). Let this one- electron problem for the first 
atom have the characteristic values E t and characteristic func- 
tions and let the corresponding quantities for the second 
atom be Ey, The fact that the i/r* and the i/t? together 
cannot constitute an orthogonal system — indeed, they are not 
even linearly independent, for the ipi alone constitute a complete 
orthogonal system — causes some difficulty. But if we break off 
the series of quantum states at a finite n — which can be chosen 
higher the larger the value of d\a under consideration — the 
finite set 
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,of functions 0 constitute an almost orthogonal system ; the 
fundamental metric form G 0 , the coefficients of which are the 
scalar products 

gik = (<Pi, fa) = \fafadV 


(where i and k run through the primed as well as the un-primed 
indices), differs but little from the unit form. Indeed, an integral 
of the form (*/r 1} </v) is of order of magnitude e~ d K To show 
this we note that if the two centres of force are nuclei or closed 
cores "with “ unit ” residual charge, the normal states of the 
atoms are given by 


^1 


V' 


-r/a 


7ra° 


IT Q ? 


p-r'ja 


where r and r’ are the distances to the two cores. The integrand 
in 

\ e -< r + f)i«dV 


= \ e 


is everywhere ^ e~ d/a . This integral can readily be exactly 
evaluated on introducing bi-polar co-ordinates (r, r\ <f>) ; the 
volume element is then 

9 

dV =s -j-rr* dr dr' 
d 

and the range of integration is defined by 

v + r' *> d, — d <Lr — r' <>d. 

On introducing 


r -j- r' r — r f 

1 == P > 1 


= A 


we obtain 


00 +1 


= f(p*-p'*)r-»dp'dp 

i - 1 
oo 

-jV- 


2 


For the /-electron problem we therefore start with the 
functions 

fah, •••,*<■) — n <i’i(xyz) 
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as approximations to the characteristic functions ; in this 
product the co-ordinates are those of the / electrons and i runs 
through the values i X) i 2 , - * *, each of which is one of the primed 
or un-primed indices between 1' and n' or 1 and n . The funda- 
mental metric form G = G 0 X Gq X • • • X G 0 has as components 
the scalar products of «/f(i 1} i 2 * * * h) with ^(&i> &2 * ' * ^/) an ^ 
the components of the energy H , the potential part of which is 
obtained by adding together the potential energies resulting 
from the attractions and repulsions of the various electrons and 
the two cores, are the scalar products of • * * if) with the 
vector • • • k f ) into which • • - k f ) is sent by the 

operator H. We consider the resolution of the unperturbed 
term 

£ = (£i + • • • + E fl ) + (Ey + • ■ ■ + £/;). 

The components 

G{h • • • if ; k x • • ' k f ) and H(i x • • • i f ; • • • £/), (1L1) 

in which the indices i, k are permutations £, respectively, of 
1, • - f 1} I', • ’• •, / 2 ', are of the form G(st~ l ) and H(st~ l ). Wc 

introduce the (real) elements Q and ft with components G(s) 
and H(s). Q and H are next replaced by O' and ft' with com- 
ponents G(s) and H(s) if s is in it' — tt x x tt 2 , and 0 otherwise ; 
the justification for this lies in the fact that the components 
associated with an s which is not in tt' are very small — they are 
of relative order e^ a . O' is in fact the modulus, whereas O 
is not ; the procedure employed previously must therefore be 
modified in the following purely formal respect. On repeating 
the reasoning, keeping in mind the fact that O is no longer the 
modulus, we find as the secular equation for the determination 
of the b terms A = W a 

| | = 0 , ( 11 . 2 ) 

in which 

e'Oe 1 = £G a p e a/J + • • •, 
e'He’ = £H^0 al) + - • • 

“P 

in terms of the notation employed in the preceding section. 

This procedure is open to the criticism that whereas the 
second order perturbations between the electrons of the same 
atom are neglected, the interaction between the two atoms, which 
is considered to be of second order, is taken into account. The 
results are therefore inapplicable to the limit d/a -*■ co and can 
at most be applied successfully in cases in which d/a is consider- 
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ably larger than 1 but not too large. On the other hand, we 
could begin by assuming that the solution of the quantum 
problem for the individual atoms is already known. Let the 
function ip x of the co-ordinates of the first f x electrons be a 
characteristic function of the first atom corresponding to the 
energy term E x (so normalized that the integral of $-£p x is unity) ; 
it will belong to a certain simple symmetry state of the first 
atom, i.e. there exists a certain real primitive idempotent element 
e l of ( 7 r x ) such that e x tp x = t p x . Similarly, let ip 2 be a character- 
istic function of the second atom for the term E 2l having a 
corresponding property 8 2 t p 2 = ip 2 . Neglecting the interaction 
between the atoms, ip = ip x . ip 2 is a characteristic function of 
the molecule consisting of the two atoms and having the energy 
E — E x J r E 2 . e' = e x X e 2 is a primitive idempotent element 
of the algebra of ir 9 = rr x X 7 r 2 and i/j has the property 

av = ^ 

The functions sift, which are obtained from p by the totality of 
/! permutations s of its arguments, span a linear function space 
(SR) of a finite number of dimensions — in which the sifj are natur- 
ally neither linearly independent nor mutually orthogonal. 
The theory of perturbations requires us to find those functions 
cf) of (SR) which are such that the orthogonal projection of H(j> 
on (SR) is proportional to 6 itself ; the factors of proportionality 
are then the values of the displaced terms, to a first approxima- 
tion. We must therefore evaluate the integrals G(s, t), H(s } t) of 

tip-sp and tf • H{siff) 
and solve the secular equation 

|A G(s, t)-H{s,t) | = 0. 

G and H depend only on t“ x s :* 

G(s, t) = G(r%), H(s 9 t) == H(r l $). 

This is proved by the fact that the integral of tfi * <f> is unchanged 
on replacing tp, p by rip, rep ( r an arbitrary permutation) ; H(sp) 
is equal to sHip because of the symmetry of the operator H. 
Let G and H again be the elements of (tt) with components 
G(s)j H($), They satisfy the equations 

e'Ge' = O, e'He f = H 

* On comparing this with (11.1) it is to be remembered that there the 
permutations 5 and t operate on the indices and not on the arguments ; hence 
the elements (11.1) are. in our present notation, 

G(t~\ 5 - 1 ) and N(t-\ *-*). 
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and are therefore of character ( e ', e'). Indeed, we have, for 
example, 

if/ = Ze'fr- 1 ) • n/>, whence ' H(stp) = Ze'ir” 1 ) • H(snf>), 

r r 

and on multiplying this latter by $ and integrating we find 
H(s) = Ze'{r~ l )H(sr) or li = He'. 

r 

It then follows that also If = 57/ whence, since e f is real, 
H = e r H and consequently H = e'He' as asserted. 

The only non-vanishing elements of the matrix ||#j;c||, 
which corresponds to the element If in the representation f), 
are (in the notation of § 10 with e\ = e') those contained in the 
square sub-matrix of length b in which the row and column 
indices J and K are of the form al. We are thus led directly 
to the secular equation 

| — H a p | = 0 


of b th degree. (The most natural method of solving this equation 
consists in finding that linear transformation which sends the 
Hermitian form with coefficients G a p into the unit form and at 
the same time reduces ||F a ^|| to diagonal form.) SH K<t is then 

a 

the trace of the matrix belonging to H in the representation f), 
or 

EH xa = 2H{s)x(s). 

ot 8 


If in particular 6 = 1 the above symmetry system of the 
molecule contains but a single term arising from the unperturbed 
term E ; its value is, in accordance with the equation derived 
above, given by 


ms)x(s) _E + I?H(s)xls) 

ZG(s)x(s) ' 1 + Z'G(s) X (s) * 


(11.3) 


The accent on the right-hand side indicates that these sums arc 
to be extended over only those permutations s which do not 
belong to v . This formula (11,3) is due to F. London. 1 * It 
will be shown later that in the case of diatomic molecules b 
is always 1 ; we must expect, however, to find higher values of 
b in dealing with more complex molecules. The real difficulty 
from the physical standpoint naturally consists in getting in- 
formation concerning the exchange energies H(s). It is to be 
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noted, however, that we need only to concern ourselves with the 
sums 

EH{rsr% .ZG{rsr-i) 

r r 

over the various classes, for since xi s ) 1S a class function all 
summands in (11.3) for elements in the same class I may be 
added together to give the above coefficients multiplied by x(f). 

Without doubt these investigations, which are as yet in their 
infancy, are of fundamental importance for theoretical chemistry ; 
the non-polar bond is due to the exchange energies. Heisenberg 
has given an explanation of ferro-magnetism with the aid of 
these same principles. 14 

§ 12. The Symmetry Problem of Quantum Theory 

On taking the spin into account the components of a vector 
x(vi), which represents the state of a single electron, has two 
indices t and i ; the first of these refers to the spin and runs from 
1 to v ) while the second refers to the translation and runs from 
1 to n. Actually v = 2 and n = oo (as long as we do not restrict 
ourselves to the consideration of quantum states with fixed 
energy). Our vector space 91 is accordingly % n = 9t v x 9l n . 
The state of a system consisting of / electrons is now to be 
represented by a tensor of order / in this space : F{i x i X) 

* • •> VV) — a “ double tensor ” which stands, so to speak, with 
one foot (the Greek indices) in the space 91„ and the other (the 
Latin indices) in 9t n - This tensor space is completely reducible, 
with respect to the algebra E vn of all symmetric transformations 
of the index pairs (a), into irreducible invariant sub-spaces, 
each of which is generated by '.n idempotent symmetry operator. 
The Pauli exclusion principle states that only one of these sub- 
spaces % n is physically realized ; it automatically abolishes the 
physically absurd existence of multiplicities which cannot be 
resolved and at the same time denies the existence of absolutely 
non-combining systems of terms. Furthermore, according to 
Pauli this ^„ n is the space {W vn } of all anti-symmetric double 
tensors. 

On ignoring the spin perturbation, is to be reduced as far 
as possible into sub-spaces ^ which are invariant with respect 
to the special symmetric transformations of the form 

F'{<- iH • ■ • hh) = Zc{i x • • -i f - fe, • • • k f ) • ■ ■ ■ i,k,) (12.1) 

W 

which do not depend on the Greek indices at all ; these constitute 
our old algebra 2 = This transition from E vn to E n is to 
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be accomplished in. two steps. We first ignore the interaction 
between spin and translation, but allow the translations to 
interact among themselves in an arbitrary manner and similarly 
the spins among themselves ; wc must then consider only the 
symmetric transformations of the form 

y(*x # • * ; *1 • • ■ Kf) ■ c[i x • • • if ; *x • • • */). (12.2) 

These transformations do not constitute an algebra themselves, 
but they belong to their “ enveloping ” algebra E v x 2J n which 
consists of all transformations whose coefficients 

* • - i/v ; K i k i * ■ • 

are unaltered on subjecting the two rows 
of Greek indices to the same arbitrary permutation cr and the 
two rows of Latin indices to the same arbitrary permutation s. 
The second step then consists in letting y in (12.2) be the identity. 
The first step thus consists merely in making the permutation 
of the Greek indices independent of the permutation of the Latin 
indices, and the second in restricting the first of these permuta- 
tions to the identity. 

In the first place, then, \ve introduce the elementary sym- 
metry operator a X s which, on applying it to the double tensor 
jF(i x ii • • • ifif), subjects the Greek indices to the permutation 
a and the Latin to the permutation s. The general symmetry 
operator is then an arbitrary linear combination 

a = 2?a(c7, s)(c r X s) 

<r, a 

of these elementary ones ; we have thus to deal with the algebra 
p X p of elements x, the components x(a, s) of which are functions 
both of whose arguments run through the elements of the group tt. 
We denote the element with components F(a , s) — (a X s)F 
by F ; the equation F' = aF (F' the double tensor obtained 
from F by the operator a) is equivalent to F' = F •&. The 
group it X it of elements cr X s contains tt itself as the sub-group 
consisting of elements s X s. So far as the first step is con- 
cerned, our problem amounts to the following : Let l(s) be the 
components of a primitive idempotent element of the algebra 
t = />=(«■); we set 

l = Il(s)(s X s) 

t 

and study the elements of the form xl in p X p. They con- 
stitute an invariant sub-space (t x t)j which is to be reduced 
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into its irreducible invariant constituents ; in Pauli’s case we 
have in particular 

\Z8,(sXs). 

The procedure which it seems natural to follow is first of 
all to express the modulus 1 of p in any two ways as the sum of 
primitive independent idempotent elements : 

1 = Ze'i, 1 = 2>,. (12.3) 

* 3 

An arbitrary element x of the algebra of p X p is reduced into 
independent constituents in accordance with the equation 

x = Zx(e’i X e,) = Zx tj . (12.4) 

Now we know from § 10, II, that the elements of the form x is 
constitute an irreducible invariant sub-space ; consider 

xl = £x iS l 

h j 

in this light. The projection x y = xl sends over into 
a certain invariant sub-space (p t7 ) of (t X r) z . Since those 
# of pn for which xl = 0 constitute an invariant sub-space of 
we have only the two typical possibilities : either (p tj ) = 0 
or this projection x -*■ xl maps in a one-to-one and similar 
manner on (p^). The sum 

(t X l)i = Zipu), (12.5) 

arranged in some particular order, is such that each term can, 
in virtue of its irreducibility, only either be contained in the 
sum of the preceding terms or be independent of this sum. On 
retaining only those terms arising from this second possibility, 
(x X x)i is completely reduced into the sum of certain of the 
(p^) ; the representation induced in (r X r); by the regular 
representation of the group it X tt is correspondingly reduced 
into its irreducible constituents of the form f )' X f). It will be 
remembered that this symbol stands for the correspondence 

(<r, s) -> 0» X 17(5), (12.6) 

where f)', f) are the irreducible representations or -» U'(cr), 
s~+U(s) of 7 T, This representation f)' X f) appears with a 
certain multiplicity b(x , x) which is determined by the number 
of pairs ij in (12.5) whose e( generate the representation f)' 
and whose e s generate f). These considerations are of course 
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merely a repetition for the case at hand of the proof of theorem 

( 6 . 1 ). 

We now return to the space of double tensors and consider 

A 

the sub-space fi defined by those of the form IF. It is the 
substratum of a certain representation Q(Z V X Z n ) of Z v X S ny 
and its complete reduction is given by the formula 

£(2V x Zn) = Z Hx'> x)(& x §«). (12.7) 

X'> K 

This remains correct even if v or n is less than /. Earlier in 
this chapter we introduced the right- and left-invariant sub- 
space t 0 of t as that sub-space consisting of all elements F which 
correspond to tensors F in the n- dimensional vector space $ft n . 
On denoting this r 0f which depends on n (and only for n ^ f 

coincides with the entire t), by t we should consider the algebra 

v n v n 

tXt instead of t X t. But if e\ is in r and e t in t, the manifold 

V ft 

of elements x(e' i X e t ) is not decreased on restricting x to X X t, 
and every e\ (e t ) which is equivalent to such an e\ (e,) also 

v n 

belongs to t (t). This shows that (12.7) remains correct under 

v n 

this restriction to r X l ; the only effect is that those terms for 
which X is the 0-dimensional representation are illusory 
We are now ready to take the second step : to perform the 
transition from the algebra Z v X Z n to S = Z n by taking y in 
(12.2) as the identity. We then see immediately that the 
representation £(£) of 2*, whose substratum consists of the 
double tensors of S in the sense of equation (12.1), is completely 
reduced into its irreducible constituents §, corresponding to 
the various primitive characters x of 7r ; in accordance with the 
equation 

2(2) = 2m(x) ■ §. 

X 

The multiplicity w(x) with which this representation § occurs 
is given by 

m( X ) = 2b( X ', x)N,(x'), ( 12 . 8 ) 

x' 

where N n (x) is the dimensionality of the representation § n , 
and the sum is extended over all the primitive characters x* 
of 7T, Hence on disregarding the spin perturbation we obtain 
the same type of reduction into non-combining systems of 
terms as before, except that the multiplicity, which was previ- 
ously equal to the dimensionality g of x, is now given by (12.8), 
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(The spin perturbation causes weak inter-system combinations 
to take place and, in addition, resolves each term of the system 
X into its m(x) components. m(x) is the multiplicity of the 
multiplet structure. Term systems x f° r which m(x) = 0 do 
not appear at all.) 

Our reciprocity theorem enables us to determine the con- 
stants b. As mentioned before, 7 r is contained in 7r X rr as the 
sub-group of elements of the form s X s\ the algebra p = (7 7) 
appears in p X p as the totality of algebraic elements of the form 
£a(s)(s X s). The elements xl of the algebra p constitute an 
8 

irreducible invariant sub-space ; let the irreducible repre- 
sentation of 7 r which is induced in this sub-space by the regular 
representation be denoted by f) z and its character by X(s). The 
space of all elements of the form xl in p X p is then <^> in the 
notation of § 10 ; it is the substratum of the representation 
<f)i> of p X p. <f)i> contains the representation f)' X f) exactly 
b times ; the reciprocity theorem then tells us that the number 
of times the representation f)' X f) contains the representation 
f) t on restricting 77 X 77 to its sub-group 77 is also b. Now this 
restriction to 77 sends the representation ( 12 . 6 ) of 77 X 77 into 
the representation 

(. s , s) -> U'(s) X U(s) 

of 77. This means, however, that b{x ) x) w the number of times 
the representation f) z of 77 is contained in the representation f)' X f) 
of 77 (no longer with boldface multiplication sign !). Hence 
b is expressed by 

b(x', X) = ( 12 . 9 ) 

With this we have carried our solution of the problem of deter- 
mining the multiplicities m(x) as far as is possible in the general 
case. 

Consider in particular the special cases (1) complete symmetry, 
£ = [§ft/] ? and (2) complete anti-symmetry, S = {ffl } — the 
Pauli case. For the first X(s) == 1. With each irreducible 
representation x is associated the contragredient representation 
with character %{ s ) = xl^ 1 ) 5 the substratum of the first 
is generated by the idempotent element e the substratum of 
the latter is generated by e. Or we may describe this situation 
by saying that x and x are the characters of mutually contra- 
gredient representations. (Accidentally x( 5 " 1 ) == x( s ) f° r 
complete symmetric group 7 7 ; this does not hold for a general 
permutation group, however, whereas our entire theory does.) 
Equation (12.9) now becomes 

x) = nx(s)x(s^)}. 
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But in virtue of the orthogonality property of characters this 
mean value is 1 or 0 according as the representation £ is equiv- 
alent to x' or not. The expression (12.8) for the multiplicity 
then assumes the simple form 


m(x) = N r (x). 


The theorem that the representation f) x 1) contains the identical 
representation s 1 once or not at all according as {)' is equiv- 
alent to the contragredient of f) or not is nothing other than 
the fundamental theorem [III, (10.5)] on which the entire 
theory of representations was based. 

In the second (anti-symmetric) case A($) = 8 $ . Now 

X*{s) = S » • x( 5_1 ) 


is the character of the “ dual ” representation p* associated 
-with f) ; if f) is generated by the idempotent element e then f|* 
is generated by the idempotent e*(s) = 8, • e(i -1 ). Or if 

f ):s-*-U(s) then tj* : 5 -*■ 8„ • U(s). The expression for the 
multiplicity is in this case 


mix) = W) 


( 12 . 10 ) 


If we denote the 1-dimensional representation .s -> 8, by {1}, 
the fundamental theorem mentioned above tells us immediately 
that f)' x f) contains the representation {1} once or not at all 
according as f)' is equivalent to f)* or not. (12.10) is the actual 
multiplet formula, for this second case is the one which is of 
interest for atomic physics. 


Additional Remarks. 

The only cases of importance for physics, (1) that of sym- 
metric and (2) that of anti-symmetric double tensors, can be 
handled by elementary methods. We again refrain as long as 
possible from making restrictive assumptions concerning the 
field over which the algebras are defined. The method will be 
illustrated by application to case (1). 

(12.11) If e x , e t are equivalent idempotent elements, then 
S x , are also. 

Proof. Let p x be mapped on p 2 by a one-to-one similarity 
correspondence r: x 2 = xfb ; b is here the element, of char- 
acter (e b e s ), into which e\ is sent by T. Let the inverse corres- 
pondence carry e t over into a, which is then of character [e t , e x ). 
r carries a over into e s ; since the element associated with a by 
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r is ab we have e % = ab. Similarly, we find with the aid of JT 1 ” 1 
that e x = ba . We then have 

^2 — ab ) - ba , e 2 ae x — — e-ybe 2 — &. 

Conversely, the existence of these equations guarantees that 
#2 — #1 — #2# 

are reciprocal similarity correspondences ^ ^tp 2 . That is, the 
existence of these four equations means that e x and e 2 are 
equivalent. We need only to “ roof ” these equations in order 
to conclude that e x and e 2 are then also equivalent — i.e., go 
over to the quantities x associated with each of these x by the 
definition x(s) = x(s~ l ). We have here neither assumed that 
the e are primitive nor that the field is algebraically closed. 

(12.12). The invariant sub-spaces $, p generated by e . e are 
the substrata of mutually contragredient representations. 

Proof. Let p consist of all elements xe ; we introduce in 
addition to this left-invariant sub-space the right- invariant 
sub-space q consisting of all elements of the form ex. Let 
tr (xy) be the trace of the elements x and y, which may vary 
freely in q, respectively ; we assert that it is a non- degenerate 
bilinear form. That is : if tr (ay) = 0 identically in q then the 
element a of p must be 0, and if tr (xb) = 0 identically in p the 
element b of q must be 0. Indeed, if z is any arbitrary element 
whatever and a is in p } then 

az = ae • 2 = a • ez = ay, 

where y = ez is in q. Hence the assumption that tr (ay) = 0 in q 
implies that tr (ae) = 0 for arbitrary 2 , whence a = 0 [cf. § 4]. 
Similarly for the remaining case tr [xb) = 0. 

Now let b and q be referred to arbitrary co-ordinate systems 
and let the co-ordinates of x t y be £ 2 , * * *, *? 2 , • * *, Vn 

respectively. Then tr (xy) is of the form 

tr (xy) = SSikHiVi* 

The theorem above shows that g ^ h and h ^ g, whence h ==' g, 
and that the coefficients may be considered as the coefficients 
of a non-singular linear transformation. Hence on choosing 
the co-ordinate system in q in an appropriate manner tr (xy) 
may be reduced to the canonical form 
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But then 

tr (xy) = tr {yx) = tr (yr -1 • rx). 

Hence the simultaneous substitution 

x' = rx ) y' = yr~\ 

which does not lead out of p, q respectively, leaves the trace 
invariant. These two transformations are therefore contra- 
gredient in the new co-ordinate systems ; our assertion (12.12) 
then follows immediately on writing the second of these equations 
in the “ roofed ” form y f = r$ and noting that $ runs through 

the left-invariant sub-space p generated by i as y runs through q. 

After this preliminary skirmish we apply the method em- 
ployed before, somewhat modified, to the case (1) in which 

l = x *)• 

We are now interested in the reduction (12.4) only for symmetric 
elements x , i.e. elements which satisfy the equations 

x(ar ) sr) — x(a , s) (12.13) 

for all r. This amounts to replacing x by xl ; we subsequently 
note that xl(e' X e) is not symmetric and accordingly multiply 
again on the right by L We thus replace e' X e by l(e' X e)l 
rather than [e f X e)l and proceed to obtain an explicit expression 
for the reduction, rather than calling on the aid of the reciprocity 
theorem. First, the components of l(e' X e) are (on ignoring 
the factor 1//!) given by 

£e'(rar)e(rs) = 2JS(s^ l r mml )e / (rcr) = $e'($" l o). 

r r 

This expression vanishes if ie' = 0 ; for e' = i we find it is 
equal to e(s~ l cr) = e(a" 1 s). This suggests that we choose 

1 = 2X, 1 = Ze t 

i i 

as the two complete reductions (12.3) of the modulus 1. The 
only terms in the sum (12.4) which then remain for symmetric 
x — xl are those of the form x(S { x e,), and the factor 1(4 1 x e<) 
is the element with components e { (a~ l s). Since x{e t x e<) has 
not been reduced identically to 0 on restricting x to the domain 
of symmetric elements, the sub-space which it generates is 

here, as before, equivalent to the irreducible jif x {)<. The 
next step consists in multiplying on the right with l , whereby 
e(ar~ 1 s) becomes, in accordance with (8.3) and (7.22), 

^e(r-^sr) = ± x (s~' a) = - g • e(«r»j). 
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Our final result is that any symmetric x can be reduced in ac- 
cordance with 


% = xe' + xs' 1 


where e(or, s) = — s(o- _1 s) ; (12.14) 


in deriving this result it is to be remembered that the number 
of times any irreducible representation appears in the regular 
one is given by its dimensionality. 

It follows from the fact that s(s) is a class function that these 
elements s', e", * * * constitute a set of independent idempotent 
elements in pXp. This result is in fact obtainable by direct 
methods and is valid, regardless of whether the field in which 
we are operating is algebraically closed or not. To show this 
we note that any “ symmetric ” element x(o y s) is a function 
only of , 9 cr“ 1 in virtue of (12.13) : x(a y s) — x(sa~~ 1 ), Thus there 
exists a one-to-one correspondence between the symmetric 
elements of p X p — the space of which we denote by [x X x] — 
and the elements of X. Direct computation shows that this 


correspondence associates with each left-invariant sub-space of 
[x X t] a left- and right-invariant sub-space of t, and conversely; 
the reduction of [x X x] into left-invariant sub-spaces thus 
parallels the reduction of X into sub-spaces which are both left- 
and right-invariant. The whole problem is thus much simpler 
for [x X x] than for X itself ; its solution is obtained by carrying 
over the equation 

% = xe' + xs + • • * (7.5) 


for the algebra p to [t X t], the result of which is (12.14). 
Nevertheless we must return to the previous less elementary 
analysis in order to see— and this result presupposes that the 
field is algebraically closed— that each of the irreducible in- 
variant sub-spaces of [t X t] obtained in this way m equivalent 
to a sub-space of the algebra t X t of the form p Xp (where 
p and p are irreducible invariant sub-spaces of t with generating 

units^and^ieteiy anti . symmetr i c case can be dealt with m a 

corresponding elementary way. t-encors in the 

The complete reduction of the manifold of tensors in the 

2-dimcnsional* spit, space fc, v = 2 is t .ccomphshed w ,th the 
aid of the Clebsch-Gordan formula [III, (5.9)]. (c y 1S J-u x W X 
. . x V If factors) .where ®, is the representation of the linear 
by itself, and by the formula mentioned above this 
representation ^is completely reducible into thejrreduableR,, 

lSoS”S y n+ e r possible 
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dimensionalities there corresponds here but one irreducible 
representation. Formula (12.10) then tells us that there exists 
only one term system having the multiplicity v + 1(= / + 1, 
/ — 1, /— 3, • • •) ; compare the beginning of § 15 on this point. 

The preceding analysis seems to me to be necessary in order 
to obtain a complete understanding of the relations implied by 
the permutation group without recourse to the approximation 
characteristic of the theory of perturbations. So far as the 
latter is concerned we proceed as follows. Again consider a 
term of the form (8.-4) of the unperturbed system, the only 
degeneracy of which is that necessitated by the equality of 
the / electrons. The perturbation equation is then 

P{ L iHi * ’ ’» l Af) ^ *) ' P{h^i i * “ *» ifkf ) , (12.15) 

where the a(s) are the exchange energies and i x • • • i f) k x • • • k f 
are obtained from 1 • • • / by the permutations s y t respectively. 
Let (f> be the tensor in spin space defined by 

^(tj, ta 2, • • •, iff) = • • • if ) ; 

the anti-symmetry of the double tensor F then tells us that 

F{hH, • • *. */*/) = ' ' • <-/), 

and on letting a'(.y) = 8 S • a(s), (12.15) becomes 

<f> = (12.16) 

The problem is thus reduced to that of finding the characteristic 
numbers of this linear correspondence in the 2f-dimensional 
space 

Let be the characteristic functions of the single electron. 
If the perturbation is due solely to the Coulomb forces between 
the various electrons, that part of the energy matrix a(i x • • • i f ; 
k x • * • k f ) which is due to the perturbation is obtained additively 
from terms of the form 



h(P>) • ■ • h f (Pf) * MPi) • • • H p f) 

P«Pp 


dV x • • • dV f 


where a =4= j8 and the denominator is the distance between the 
two points P * and Pp. The orthogonality of the if/ tells us that 
this integral can be non-vanishing only if the permutation s } 
which sends the set of indices k into the set i (both of which 
are permutations of 1, 2, • • •, /), is either the identity or the 
transposition (ocj8). In this latter case we find 


a(s) = b* _ |j 
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On the right-hand side of (12.16) we then have only the terms 
arising from s — I and the transpositions s = (a/J) : 

4 = Ml) - ZE a j,(*p)}<l>. (12.17) 

Dirac has given a remarkable formula for the transposition 
acting on a spin tensor. Let be the spin of the a th electron : 
S%, S“, S“ are then the operators 


1 

! 

- <s> 

1 

O 


1 O' 

0 

1 

i 0| 

J 

0 -1 


acting on the a th index of the tensor <£(m 2 ■ • ■ u). On calculating 
in particular 8 

(&&) = sjsj + sisi + sjs* 

(which should perhaps be written (& x © 4 ) instead, since 6 1 
affects only the first index and & only the second), we find that 
it is the operator 


L l *2 


0 0 

1 


1 0 

- 1 

2 

0 1 

2 

- 1 

1 1 


1 


acting on the first two indices, all other places being 0. Hence 
~{1 + &&)} is the substitution 

<£(00) -> <£(00), <£(11) 9l(ll) ; <£(10) -> <£(01), <£(01) ,£(10) 

or the transposition of the first two indices. The energy (12.17) 
may then be written in the form 

H = E 0 - l ZE a p(&&). (12.18) 

"*<0 

This may be interpreted as saying that the coupling between 

the electrons * and p is responsible for the term — 

in the energy operator. However, the constant E 0 does not 
represent the energy of the unperturbed system. 15 
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C. Explicit Algebraic Construction 

§ 13. Young’s Symmetry Operators 

We now supplement the general theory developed above 
by an explicit algebraic construction of the irreducible repre- 
sentations of the symmetric permutation group tt = 7 r f . This 
problem is, as we know, equivalent to that of constructing the 
primitive symmetry classes of tensors of order / by means of 
idempotent symmetry operators e; here a “primitive” sym- 
metry class is one such that the symmetry of the tensors be- 
longing to it cannot be further increased by the addition of 
further symmetry conditions — such an additional condition 
either reproduces all the tensors of the class or reduces them all 
to 0. This construction is due to A. Young and G . Frobenius 16 ; 
with its help we are able to verify step by step the entire theory 
of representations of the symmetry group in an explicit and 
elementary manner. 

We are already acquainted with two very simple processes 
which yield tensors of maximum symmetry : “ symmetrization,” 
by means of which the tensor F yields the completely symmetric 
tensor £sF , and “ alternation,” which sends F into • sF . 

® g 

The first of these processes can be readily generalized as follows : 
We divide the range from 1 to n of the u variables ” i x i 2 • • • i f} 
on which the general tensor component F(i x i 2 • • • i f ) depends 
(or, what amounts to the same, the sub-indices 1, 2, • • *, /), 

into sub-sets of lengths /1 ,/.,•••; A + / 2 H = /. We then 

symmetrize with respect to the indices of each of these sub-sets. 


n 

n 

r 

n 

r 1 

1 




1 

1 





r 





L 



Pattern 7, 5, 4, 4, 1. 

This distribution into sub-sets may be readily visualized with 
the aid of a u pattern ” P = P(f x , / 2 , • • •) as illustrated in the 
accompanying figure [for the pattern P( 7 , * 5 , 4 , 4 , 1)] ; each of 
the / squares in the pattern is occupied by a different one of the 
7 integers 1, 2, ^ • •, /. Each of the sub-sets mentioned above 
constitutes a horizontal row of the pattern, and the various rows 
are arranged one under another. The individual sub-sets may 
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be arranged in order of decreasing length : f x ^ / 2 ^ • • • ; the 
pattern then consists of non-interrupted vertical columns as 
well as non-interrupted horizontal rows. Those permutations 
p which permute the members of each row among themselves 
constitute a sub-group (p) of 7 r of order /, !/ 2 ! • • • [denoted in § 8 
by ir(fi, ft, * * *)]* The symmetry operator described above, and 
which is to be applied to an arbitrary tensor, is 

* = Ip ; 

V 

henceforth p will always denote an arbitrary permutation which 
sends no numeral of one row into another row. 

So far we have made no use of the process of alternation. 
If after having symmetrized with the aid of the operator a we 
alternate with respect to certain of the variables or sub-indices 
1, 2, • • •, /, we certainly obtain 0 if any two of these numerals 
are in the same row, for the tensor obtained by the symmetriza- 
tion is symmetric with respect to any two such numerals and 
the result of subsequently alternating with respect to them must 
be 0. To avoid this situation we choose one variable in each of 
the rows and alternate with respect to them ; since the order 
of the variables in each row is so far immaterial we may place 
these chosen variables in the first column. We then disregard 
the first column and proceed to alternate with respect to a set of 
variables obtained by selecting one from each row of the re- 
mainder of the pattern ; these variables may now be shifted into 
the second column. This process is continued until we have 
covered the entire pattern ; the result is that we have symmetrized 
with respect to the rows and have followed this symmetrization by 
alternation with respect to the columns . Let q denote an arbitrary 
permutation which permutes the variables in each column among 
themselves ; these q constitute a certain sub-group (q) of 7 r. 
The alternation described above consists in applying the sym- 
metry operator 

b = ZSa • q, 

Q 

and the entire process consists in applying the resultant operator 

c = ha = • qp. 

Pi Q 

We call c the Young symmetry operator belonging to the 
pattern P. 

In order to obtain a unique symmetry operator c associated 
with a given pattern P we must specify the way in which the 
numerals from 1 to n are to be distributed in P : they shall be 
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introduced in such a way that on reading the pattern, as one 
would read a page of a book, they appear in their natural order 
1, 2 If we write them in any other order, say that ob- 
tained from the standard form with the aid of the permutation r, 
we obtain a “ conjugate ” element c r which, as is readily seen 
on considering the relation between the tensors generated by 
these two operators, is related to c by 

c T r = re or c r (s ) = c(r~ x sr). 

Hence the introduction of r results merely in a new name. 

From now on we operate with symmetry quantities, i.e. 
elements of the algebra (n), instead of tensors ; we consider the 
invariant sub-space p c of r consisting of all elements of the form 
y = xc and the representation Ij c of rr induced in it by the regular 
representation. With p c is associated the symmetry class 
of all tensors of the form cF. If we replace c by one of its con- 
jugates c T we obtain instead of p c an equivalent invariant sub- 
space ; in this sense the order in which the variables are written 
in the pattern is quite immaterial. We hope that p 0 is irre- 
ducible and that the totality of representations f) c associated 
with all possible patterns constitutes a complete set of inequi- 
valent irreducible representations of 7 r. This hope is strengthened 
by the fact that the total number of patterns is just equal 
to the number of inequivalent irreducible representations. To 
show this we note that the number of patterns is equal to the 
number of partitions of / into integral non-negative summands 
/ = A + /i+ ■ “ * which satisfy the condition f x ^/ 2 * * *• 

On writing 

fl r li fz /a == *2> ’ * * 

we see that this number is equal to the number of solutions of 
the equation 

l*i + 2r 2 + 3r s + • • • = / 

for non-negative integral r. But we have already seen that this 
is the number of classes of conjugate elements in i t and, by the 
general theory, is therefore equal to the number of inequivalent 
irreducible representations of tt. 

If the dimensionality n of the vector space is less than / 
the only non-vanishing symmetry classes are those arising from 
patterns containing at most n rows, for if the first column is 
longer than n alternation with respect to the variables standing 
in it alone causes an arbitrary tensor to go over into 0. The 
only patterns which we need in this case are consequently those 
obtainable from the algebra r 0 , instead of r, where x 0 = as 
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defined in § 2 above. The number of inequivalent irreducible 
invariant sub-spaces into which the tensor space 91/ can be 
reduced is accordingly decreased to the number of partitions 
of / into n integral summands / = f x + / s + • * * + f n for which 

f\ S /2 ^ * S fn S 0. 

A permutation s = qp which is obtained by composition 
from a permutation p of ( p ) and a permutation q of (q) can be 
so obtained in only one way. This is an immediate consequence 
of the remark that the equation qp = I can be fulfilled only by 
p = I, q — I, for it asserts that p = q~ x belongs to (p) as well 
as to (q). The components of the symmetry operator c can 
therefore be described as follows : c(s) = 0 tcnless s belongs to 
the set (q){p) / when s belongs to this set c(s) = ± 1 according 
as the unique decomposition s = qp yields an even or an odd 
permutation q. 

We must now prove the following three assertions con- 
cerning c : 

(1) c is essentially idempotent ; or, more precisely, c satisfies 
an equation cc = y • c } where y is a non-vanishing numerical 
factor. Furthermore, y is an integral positive number which 
is a factor of /!. Then replacing e by e = c/y , e is idempotent. 

(2) The sub-space p c is irreducible, the e introduced in (1) is 
primitive. 

(3) Different patterns lead to inequivalent sub-spaces p c . 

The execution of this programme depends upon a simple 

combinatorial auxiliary theorem, which we now proceed to 
develop. Denote the lengths of the columns in the pattern 
P with rows of lengths f h f 2 , ■ • • by/*, /*, • ■ • : 

fiZft*- • -,/r 

/i +/» + •••= A* + ft + ■ ■ ■ = /• 

We think of the pattern P as cut out of a rectangular chess- 
board consisting of / x horizontal rows and f* vertical columns, 
and the permutation 5 as operating on / chess-men occupying 
the / fields. On interchanging rows and columns in P we obtain 
the dual or transposed pattern P*. 

Auxiliary Theorem . A permutation s belongs to (qp) if and 
only if any two pieces originally in the same row are not sent into 
the same column by s . 

Proof . It is evident that this condition is necessary in 
order that $ belong to (qp). The change of position which one 
of the pieces suffers as a result of s can be accomplished in two 
moves, a horizontal and a vertical move (in this order). It 
is at first conceivable that the horizontal move could send the 
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piece into a field of the original board which is not contained in 
the pattern P. If the decomposition s — qp is possible p must 
represent the horizontal move and q the subsequent vertical 
one ; it is clear that q and p are thus uniquely determined. 
Now if s satisfies the conditions enunciated in the above theorem 
the horizontal move can never throw them into the same column, 
i.e. the same field. It only remains to show that the horizontal 
move can never send any piece out of the pattern proper, or : 
those pieces which $ sends into a column of length f* come from the 
first f* rows of the pattern . We divide the chess-board horizontally 
into an upper and a lower part, the upper consisting of the 
first /*’ rows. The pieces which are sent into the first column 
by s are, by assumption, from ff different rows ; hence there 
are at least (and therefore exactly) /* — /* of them which come 
frbm the lower part of the board and not from the first /* rows. 
Note that f* — - f* is exactly the number of fields in the first 
column which lie in the lower part of the board. On applying 
this argument to each column in succession we find that the 
number of pieces which s sends into those columns which pro- 
trude into the lower part of the board is exactly equal to the 
number of fields in this part of the board. Hence all the pieces 
in the lower part of the pattern are sent into columns whose 
lengths are greater than /*, and the only pieces s sends into a 
column of length/* come from the upper part of the board. 

This auxiliary theorem allows us to assert that if 5 does not 
belong to ( qp ) then there exist two pieces in a single row which 
are sent into the same column by s. If u denotes the trans- 
position of the two pieces in their initial positions and v their 
transposition in the final then su = vs ; here u belongs to ( p ) 
and v to (q). 

§ 14. Irreducibility, Linear Independence, Inequival- 
ence, and Completeness 

We now examine the Young symmetry operators c associated 
with the various patterns. Obviously 

c{sp) = c{s), c[qs) = 8, • c(s), (14.1) 

where p, q are, as usual, elements of (p), (q), respectively. 17 

Theorem (14.2). Any element a of (tt) which satisfies equations 
(14.1) . 

a(sp) = a(s), a(qs) = 8 a . a(s), (14.3) 

is a multiple of c. 
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To prove this theorem we first note that (14.3) implies 
a[qp) = S a • «(l) ; 
on setting #(l) = A the equation 

a(s) = A • c(s), 

which is to be proved, is certainly correct for all group elements 
s of the form qp. We must next show that a(s) = 0 if ^ does 
not belong to the set (qp). Such an s implies that there exist 
transpositions u and v, lying in (p) and (q) respectively, for 
which su = vs. But then by (14.3) 

a(su) = a(s) } a(vs) — • a(s) = — a(s), 

whence a(s) = — a(s) or a(s) = 0. 

Theorem (14.4). Every element of (n) of the form cxc is a 
multiple of c . 

It was shown in the general theory that this theorem is 
valid if c is a primitive idempotent element of (tt) and if 
the field in which \ve operate is algebraically closed ; here we 
approach it from the opposite direction, as we wish to show 
directly that it holds for c in order to prove that c is primitive. 
Now obviously any element of the form xc satisfies the first of 
equations (14,3) and any element cx the second ; hence any 
element of the form cxc has both properties and is consequently 
a multiple of c. 

Theorem (14.5). cc = yc and y is a positive integer which 
is contained in /!. 

That cc is a multiple of c follows immediately from the 
previous theorem ; y is therefore the number 

y = Zc(t)c{t') = Zc(s) • c(s-'). 
tv ^ I * 

Let the sub-space p c of elements of the form xc be of dimension- 
ality g . The projection 

x -> y = xc (14.6) 

projects any element x into an element lying in this sub-space 
and is, within itself, merely the multiplication y = yx. Its 
trace is therefore yg ; to see this we need merely to adapt the 
co-ordinate system in group space to the sub-space p c . On 
the other hand its trace is immediately obtainable from (14.6) or 

y(s) — Zx(t)c(s~H) ; 

t 

it is f\c( I) = /!, hence 


y£ = /l- 
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Consider the meaning of this fact that y is positive, i.e. that 
c(s)c(s -1 ) is oftener positive than negative ! 

e == cjy is idempotent ; hence the character of the repre- 
sentation 1) e induced in p 0 by the regular representation is 
by (8.3) 

X(s) — p Z^r-'sr). (14.7) 

We obtain as a by-product the fact that the dimensionality g 
of the representation f) c is a factor of /!. 

Theorem (14.8). is irreducible. 

We know already that this theorem is a consequence of (14.4), 
but it may be instructive to prove it directly as follows. Let 
e = cjy be reduced into two independent idempotent elements 
e x + e z ; then 

ee x = e x e = e lf whence ee x e = e x . 

Now by theorem (14.4) any element of the form ee x e is a multiple 
of e ; hence e x = A*. e x e x = e x then yields the equation A* = A 
for the number A. Consequently either A = 1 or A = 0, i.e. 
either e x = e or e x = 0. 

We shall say that the pattern P f with rows of lengths 
fii f'% • • • is higher than P if the first non-vanishing difference 
fi — A, A— A,"' is positive. 

Theorem (14.9). If the pattern P’ is higher than P then 
c’c = 0. 

We do not here assume that the variables are written in 
the patterns P, P' in the normal form agreed upon in the previous 
section — i.e. in which the numerals appear in their natural 
order on reading the pattern as one would a page of a book. 
The proof is based on the fact (F) that there exist two numerals 
which are in the same row in the pattern P' and in the same 
column in the pattern P. If v is their transposition it belongs 
to the group (p ') associated with the rows of P' and at the same 
time to the group ( q ) associated with the columns of P ; hence 

c'{sv) = c'[s), c(vs ) = — c(s). 

On replacing vt in 

c'c{s) = Zc'{sr*)c(t) = - Zc'(st-')c(vt) (14.10) 
by t alone we find 

c'c(j) = — Zc'(st~h>)c(t) = — Z^(sr l )c(t) — — c'c(s). (14.11) 
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(F) is evident if the first row of P' is already longer than the 
first row of P, for it is impossible to distribute the // numerals 
in the first row of P' over different columns of P if f x </'. 
If fi = fx and the numerals of the first row of P' are actually 
distributed over different columns of P, we discard the first 
row of P' and the f x fields of P containing the same numerals as 
this row. On shifting the fields of P upward to fill in the gaps 
P is transformed into a pattern which has exactly the same 
appearance as if we discarded the first row oi S ; we are only 
interested in the fact that this process leaves all pieces in their 
original column. The proof can then be completed by mathe- 
matical induction — by assuming that it holds for the abbreviated 
patterns obtained by omitting the first rows of P and P'. 

Theorem (14.12). Let c ) c\ • * • be the Young symmetry 
operators associated with different patterns P, P r . • • • ; the corre- 
sponding sub-spaces p c , • • • are then linearly independent. 

Let the P, P', P", • •’ • be arranged in such an order that 
P is higher than P', P' higher than P", • • \ An element x of 
p = p c is reproduced by right-multiplication with cjy but, by 
the previous theorem, this process transforms all elements 
x f of p\ x" of p r \ • * • into 0. Assume there exists such a linear 
dependence 

# + *' + *" + • • • = 0 ; 

on right-multiplication with c we find x = 0 and consequently 
%’ 4 * x" + • • * = 0. The theorem is thus reduced to the 
same theorem for the smaller set P', P", • • *, and the proof 
follows by mathematical induction. 

Theorem (14.13). Different patterns P, P' give rise to in- 
equivalent sub-spaces ^ c , p cf . 

The proof is accomplished by a direct derivation of the 
orthogonality relations. Let P' be higher than P. Since we 
did not assume in proving theorem (14.9) that the numerals 
were distributed in the same order in the two patterns P and P', 
we may replace the element c with components r(s) by the 
“ conjugate ” element c r ~i with components c(rsr~ x ) : 

2jC f (st~ l )c(rtr~ x ) = 0. 

t 

Summation with respect to r yields 

ZC(sr') ■ X c{t) = o. 

t 

On writing % = Xc, X ^ Xc f ^his formula is equivalent to 

= o. 
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In particular 

ZY^xW = o. 

t 

If the two sub-spaces were equivalent we would have x'W = 
and since x^” 1 ) = xM f° r the symmetric group the above 
equation would yield 

Zx 2 (s) = 0 . 

8 

But this is impossible, for by (14.7) the character x(^) has 
rational components, and in particular x(l) = g =f= 0. 

This last conclusion is valid only if the number field in which 
we operate is non-mo dular ; naturally this restriction is irrelevant 
for physics. Nevertheless it constitutes a blemish which should 
be removed, for the remainder of our deductions only introduce 
the minimum assumption that /! is not 0 in the field under 
consideration. Now from the general theory we know that 

Theorem (14.14). Zx( s )x[ s ~ x ) =/!• 

* 

The blemish mentioned above is removed by proving this 
theorem directly. We must show that 

Zx( 5_1 ) ‘ e(s) — 1 

8 

or 

Zeirs-'r-Ws) = 1 . 

r,8 

On replacing the summation variable s by sr , where r is fixed, 
this becomes 

Ze(sr)e(s~ 1 r-'-) = 1. (14-15) 

r,8 

Consider next the function 

a(s, s') = 2Je(sr)e[s'r- 1 ) ; 

f 

as a function of 5 it satisfies the second condition in (14.3). 
But the first of these conditions is also satisfied, as can be seen 
immediately by replacing r in 

a{sp, s') = £e(spr)e(s'r~ v ) 

r 

by the summation variable Hence by (14.2) 

a[s, /) = c(s) • 2Je(r)e(s , r~ 1 ) = c(s) • e(s') = -c(s)c(s') 

r y 
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and therefore the left-hand side of (1415) or 

£a{s, s- 1 ) — - Zc(s)c(s-i) 

* y * 

is actually equal to 1. 

The relations 

27x(s)x'(-s _1 ) = 0 or/! (14.16) 

S 

show that the primitive characters obtained by our construction 
from the various symmetry patterns are linearly independent, 
and since their number is equal to the number of classes of 
conjugates in the group tt, any class function can be represented 
as a linear combination of the x(s). In particular, the function 
l(^), which is 1 for 5 = | and otherwise 0, must possess such 
an expansion : 

/! * 1 (s) - m x (s) + m'x'(s) + • • (14.17) 

Multiplying by xO^ 1 ) an< ^ summing over s we obtain, with the 
aid of the orthogonality relations (14.16), the equation 

/! x(D =f\m 

or 

(14.18) 

for m* Since 

x (s) = Ze(rsr~ l ) = 2> r (s), 

r r 

equation (14.17) gives the reduction of the modulus 1 into 
primitive idempotent elements e T . Hence the regular repre- 
sentation is reduced into the irreducible representations f) c 
associated with the various symmetry patterns. Since f\l{s) 
is the character of the regular representation, eq. (14.18) is a 
direct verification of the fact — proved in the general theory that 
the number of times each irreducible representation appears 
in the regular representation is equal to its dimensionality. 
This completes our direct and elementary development of the 
theory of the representations of the symmetric group. 

The method of proof employed in establishing theorem (14.9), 
i.e. that cc’ — 0 if P' is lower than P, will now be used to answer 
another question. Let a be the operator, introduced in the 
previous section, which symmetrizes with respect to the ciphers 
occupying the rows of P : 

a{s) = 1 Or 0 according as s belongs to (p) or not, 
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and let the numerals be written in the pattern P', which is 
lower than P, in an arbitrary order. I assert that ac’ — 0. 
There exist two numerals which occupy the same row in P 
and the same column in P'. If v is the transposition of these 
two numerals then 

a(sv) = a(s), c'(vs) = — c’(s), 

and the assertion is proved with the aid of (14.10), (14.11) on 
replacing c\ c there by a, c\ Hence also 

ffa(st~ l )c'(rtr~ l ) = 0, 

t 

Za(sr l )x'(t) = 0 or Za(r~ l )x(rs) — 0. 

t r 

That is, the sum of the x'(t) extended over all elements t = rs 
which are left-equivalent to s mod. (p) [i.e. r in (p)], is zero. 

In particular, £x'( s ) — 0> where the sum is extended over all 
# 

elements s of (p) ; x is the character associated with a pattern 
P' which is lower than P. On applying this result to the con- 
siderations of § 8 (in particular, to (8-13) ff.) we find : 

If the individual I has the simple energy levels E t , Ii t , the 

term 

fi^i + A^a + • • • (A ^ A +/* + ' * * “/) 

of the unperturbed system V appears only in those symmetry 
classes of tensors whose pattern P' — P[ff, ff • * •) is not lower 
than P = P(AA • ; •)• 

Thus we saw in discussing the two-electron problem that 
terms of the form E x + E t appeared in the “ anti-symmetric ” 
as well as the “ symmetric ” term systems, whereas terms such 
as 2 Ex appeared only in the latter. 

Finally, we consider the relations existing between two 
dual patterns P and P* with generators c, c* and characters 
X, x*- The group (p) which permutes the members of each 
row of P among themselves coincides with the group (q m ) which 
permutes the members in each column of P* among themselves ; 
similarly ( q ) = (p*). U s = qp is in (qp), then s -1 = p~ l q~ l — 
q*p* is in {q*p*), and conversely ; for such an element 

c[s) = K, c *(s 1 ) = = Sp. 

Hence in general — even when $ is not in (qp) and, consequently, 
is not in (q*p*) — we have 

c*(s~') = 8, • c(s). 
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“ Dual elements c , c* are therefore related to each other in 
exactly the same way as the “ duals ” introduced in § 12. 
Further 

y* = y ; x* (^ _1 ) = x*( s ) = • xM ; f = g* 

If P is higher than then conversely P* is lower than Q*. 
For if we lower P by taking away the last field of one of the 
rows of P and adding it to the end of a later (shorter) row, one 
of the columns of P is increased at the expense of a later (shorter) 
column ; by such a process of shifting individual fields, in which 
no gap is to occur in a row or a column, P can be transformed 
into the lower pattern Q. 


§ 15* Spin and Valence* Group-theoretic Classification 
of Atomic Spectra 

If the vector space 9ft = is only 2-dimensional, the only 
symmetry patterns P which give rise to primitive symmetry 
classes of tensors of order / are those which consist of at most 
two rows. Let the first row contain l + v fields and the second 
l ; then 

v=f-2L 


The symmetry pattern P is thus uniquely characterized by the 
number v , which we call its valence, and v may assume any of 
the values /, / — 2, / — 4, • • \ Let % be the totality of tensors 
of the form cF obtained by applying the Young symmetry 
operator c associated with the pattern P to the totality of tensors 
F, and let be the representation of the linear group, the 
substratum of which is the tensor manifold $ v . A sufficiently 
general tensor of order / which is symmetric in the first as well 
as the second rows of indices is given by 


E X J X ' 1 1 X J X J {l + v terms) 

X t) X t) X • • • x t) (l terms), 

where % . x 

5 = (#i, %2)) ty (Xn V 2 ) 

are two arbitrary vectors. On alternating with respect to the 
columns we find that the representation of the linear group 
c = c 2 is that one which is induced on the quantities 

(x& t — x&x ) l ' x i + r 2 = v). 


Hence £>» is the representation of the linear group which was 
denoted in III, § 5, t>y ©«• 
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This remark supplies the connection with the symmetry 
problem of quantum mechanics as dealt with in § 12 — on apply- 
ing the Pauli exclusion principle when the existence of the spin, 
but not its dynamical effect, is taken into account. 18 Since 
the spin space is 2-dimensional, formula (12.10) tells iis that 
the only patterns P which give rise to a term system are those 
whose duals P* consist of at most two rows, i.e. those P which 
themselves have but two columns. If v is now the number of 
fields by which the first column of P exceeds the second we call 
v the valence of the term system or of the corresponding state of 
the atom. The multiplicity of the term system with valence 
v i s v i } and to each of these possible multiplicities corre- 
sponds but one term system as we have already seen in § 12 
(in particular p. 356). We previously (Chap. IV) called s = v/2 
the “ spin quantum number.” 

The fact that the longest column of P cannot exceed the 
dimensionality N of the vector space % associated with the 
electron translation may result in a further restriction on the 
possible symmetry patterns P. This situation cannot arise 
as long as we deal with the total 00 -dimensional system space. 
On the other hand if we restrict ourselves, for example, to those 
states of the electron which are characterized by a fixed principal 
quantum number n and a fixed azimuthal quantum number l 
— and which therefore constitute a (21 + 1) -dimensional sub- 
space 3 fi(nZ) within 9ft* — i.e. if we consider only those states of 
the atom in which all the / electrons outside a closed core are 
in 9fl(wJ), the dimensionality N is reduced to 21 + 1. Then / 
cannot exceed 2(2/ -\- 1) and the possible valences of the states 
under consideration are given by the following table : 


/= 

1, 2, 

3, 

4, • • • 

•••, 4/, 4/+1, 

4/+ 2 


1 0 

1 

0 

• • • 0 1 

0 

V 

2 

3 

2 • • • 

. . . 2 





4 




This table again gives us the alternation law, but shows that in 
addition the number of possibilities decreases from the middle 
of the table on. The possible multiplet numbers 2s + 1 of 
terms in these states is one greater than v. 

This “valence” v y which describes the symmetry state of 
the system, is actually the chemical valence, as was shown by 
F. London . lfl We allow two atoms, consisting of f X) f % electrons 
respectively, to come together to form a molecule with / = / x + f% 
electrons. Let Sp 2 , be irreducible invariant sub-spaces of 
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the system spaces SR/ 1 , SR/ 2 respectively. In order to find which 
symmetry states the molecule is capable of assuming when the 
first atom is in the state ^ and the second in we must com- 
pletely reduce the space x into its irreducible constituents. 
If we consider this decomposition as taking place in the vector 
space of electron spin rather than in that of electron trans- 
lation (the justification for which will be given below), the 
problem is solved by the Clebsch-Gordan series (III, 5.9) ; it 
tells us that if the valences of the symmetry states of the two 
atoms are v u v 2 the resulting symmetry states of the molecule 
are those with valences 

v v x + v 2l v x + v 2 — 2, v x + v 2 — 4, • • •, \v x — v 2 \. (15.1) 

1 his situation can be readily visualized in terms of the symmetry 
patterns as follows. Bring the two symmetry patterns P 1} P 2 
of the two atoms into the positions shown in 
the accompanying diagram and then shove 2 
vertically upwards, one field at a time, until one 
of the two columns of the combined pattern is 
closed ; each of these steps represents a possible 
symmetry pattern for the molecule, in which v is 
the number of fields which are not paired hori- 
zontally. The saturation of the valence bonds 
here appears as the pairing of fields or, more physi- 
cally, as the saturation of the spin of an electron 
in one of the atoms with that of an electron in the 
other. The empirical theory of the valence bond 
has therefore a rather profound significance. 

We have yet to justify our use of spin space 
rather than translation space in the above. Let the representa- 
tion of the permutation group rr f corresponding to the two- 
columned symmetry pattern of valence v be denoted by f) v ; its 
dual f)* consists of but two rows. The Clebsch-Gordan series, 
together with the third reciprocity theorem of § 10 as applied to 
the linear group c = c 2 , tells us that on restricting tt to the sub- 
group 7r' = 771 X tt 2 which permutes the electrons of each atom 
separately the representation f)* of tt contains the irreducible 
representation f)* X f)J t of tt' once or not at all, according as 
v is one of the values (15.1) or not. From this it follows im- 
mediately that the same result holds for the duals on reducing 
f) 9 after restricting tt to tt'. Applying the same reciprocity 
theorem in the opposite direction for the case in which C — C n 
is the linear group in n dimensions, we find that the representa- 
tion Sq Vx X of C (or the algebra 2) contains the representation 
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once or not at all according as v is one of the values (15.1) or 
not. On reducing ^ X into its irreducible constituents 
we may expect to find other representations — which may even 
occur more than once — in addition to these simple but these 
additional representations will correspond to symmetry patterns 
with more than two columns and are, in virtue of the Pauli 
exclusion principle, of no importance for physics. The number 
b introduced in § 11 is accordingly at most equal to 1 in the case 
of diatomic molecules. 

Molecules which consist of a larger number of atoms can 
be studied by the same method. If in particular we are in- 
terested in the case of three atoms and their valences are v h v 2y v 2 , 
we can determine with the aid of the Clebsch-Gordan series 
the number b v of times the representation S* occurs in the 
reduction of £ Vl X £ Vs X ® rs . Those v for which b v 4= 0 are 
the valences of the possible symmetry states of the molecule 
and b = b v (which may here be greater than 1) are the corre- 
sponding multiplicities. The characterization of the quantum 
and symmetry states of a molecule which is formed by the 
union of three atoms in given quantum and symmetry states 
requires, in addition to the valence v ) a further index which 
distinguishes between the various b v possible energy levels. 
But this description of the various possibilities differs from the 
empirical theory of the valence bond — the manifold of possible 
bindings is smaller. 20 

Classification of Spectral Terms. 

Let the unitary or the complete linear group c yn in the system 
space 91 of the single electron be restricted to the group c,XC n 
of transformations S v X the two factors of which are trans- 
formations of the spin and translation spaces 9^, 9l n respectively : 
9t = 9t, X 9l n . The space {9?/} of anti- symmetric tensors of 
order / is then reducible into irreducible invariant sub-spaces 
with respect to the algebra of symmetric transformations of 
the form (12.2). We thus obtain a distribution (I) of spectral 
terms among the various symmetry classes ; this step is of 
universal validity and is applicable to molecules as well as 
atoms. 

The further classification of terms, as discussed in Chapter IV, 
A, refers to “simple” rather than “quantum” states, i.e. to 
those states which are related to spatial rotation and moment 
of momentum in the same way that the quantum states are 
related to displacement in time and energy. Naturally this 
application of the rotation group b = b 3 (the elements of which 
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we now denote by a, r, * * •) is significant only for atoms (or 
ions), the nuclei of which are considered as fixed centres of 
force. So long as we concern ourselves only with the electron 
translation and neglect the mutual perturbations of the electrons, 
which are characterized by principal and azimuthal quantum 
numbers n and l, each individual term of the system is char- 
acterized by the quantum numbers (n 1} ; n 2 , Z 2 ; • • • ; n u l f ). 

The number of times such a term appears in a given symmetry 
system is equal to the dimensionality of the linear sub-space 
in which the atomic states under consideration lie. The resolu- 
tion caused by the mutual perturbations parallels the reduction 
of this sub-space into its irreducible constituents Rl with respect 
to the group b of rotations ; the resulting components of the 
term have the natural multiplicities 2 L + 1. The spin space is 
similarly to be reduced. Let b induce the representations 
ij v : a U(ar) and © : a -> V(or) in 9ft v and $R n respectively. This 
second step (II), in which the spin and translation spaces are con- 
sidered separately, is interpreted from the stand-point of group 
theory as meaning that we associate with the element (a, r) 
of b X b the transformation U(o) X V(r ) ; we thus obtain a 
6-parameter sub-group of q, X c n , and on restricting c* X c n to 
this sub-group our original irreducible sub-space is further 
completely reducible into irreducible constituents. The irre- 
ducible representation of b X b induced in such a sub-space is 
of the type X The final step, (III), consists in introducing 
the coupling a = t : the 6-parameter sub-group is thereby 
restricted to a 3-parameter sub-group, i.e. that sub-group 
induced in the total system space by the rotations b. The 
spin perturbation then resolves each such term multiplet into 
its (at most 2s + 1) components : 

X ®i = (j = l + s, l + s — 1, • * •, | i — ^|) ; 
i 

naturally S) 5 X is here a representation of b instead of b X b. 

Actually v — 2, and the transformations induced in the 
spin space by the rotation group constitute the unitary group 
in two dimensions. Consequently the transition from c„ to f) v 
in step (II) involves no reduction in spin space — this is the 
essential simplification caused by the fact that 9L has so small 
a dimensionality. 

To the symmetry system of terms corresponds a certain 
irreducible representation of the unitary group u in the space 
SRt of the electron translation and with it a certain irreducible 
characteristic (§ 9) 

X = X(e lf e* * • •)■ 
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The co-ordinates x t in the space 31* are broken up into classes 
in the manner described in Chapter IV, § 1 : 

x[m) [m == l, l - 1, • • •, - l ] ; 

x'(rri) [«' = /' • V - 1, • • •, - l'] ; • 

Each of these classes describes a (2 1 + 1) -dimensional sub-space 
9R(wi) of JR* in which the group b 3 of spatial rotations induces 
the irreducible representation SB* and is characterized by the 
principal quantum number n and the azimuthal quantum number 
L The arguments e* of X are correspondingly broken up into 
classes- To give the principal and azimuthal quantum numbers 
of the individual electrons — without stating how these numbers 
are distributed among the / electrons — we need only to state 
how many (/') electrons are represented by states in each of 
the various sub-spaces SR' = SR(nZ). If, for example, 3 of the 
electrons are in SR' and the remaining 5 in SR" (/== 8) we must 
separate out that part of X which is of degree 3 in the variables 
Si belonging to SR' and of degree 5 in those belonging to SR". 
The multiplicity M of the corresponding term 

E[ n ih) + + * * * + Efo/lf) 

of the “unperturbed” atom in the symmetry . system under 
consideration is then obtained from the part of X described 
above by setting all e contained in it equal to unity. In order 
to determine how this Jkf-fold term is broken up on taking the 
mutual influence of the electrons into account we replace the 
variables s (m) of the class SR (nl) by e(w) = s w , the variables 
e'(ra') of the class SR(n'e') by s'(m') = s m/ (with the same e), etc. 
The resulting expression must be a linear combination of the 
sums 

4- L gl + 1 g — L 


with non-negative integral coefficients. This enables us to 
tell which of the various total azimuthal quantum numbers L 
appear, and how often, in the resolution of the above term ; 
each such L-term has still the multiplicity 2 L + 1. 

Example . We consider, as an example, the case in which 
/= 3 and all three electrons are in the same sub-space 9ft(wZ). 
The possible symmetry patterns are 
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The Pauli exclusion principle allows only the first two ; their 
valences are v = 3 and v = 1, and the corresponding terms are 
therefore quadruplets and doublets, respectively. The first 
pattern defines the anti-symmetric tensors of order 3 and the 
third the symmetric tensors. The corresponding characteristics 
are therefore 

X 1 = S 1 = Z SiEiSfc, x 3 = Z ' 

i <j <k i <; fc 

On introducing 

$ 2 ^ S 3 = 

i 4 

we have X 3 = s x + s 2 + 5 3 . The dimensionalities of the re- 
presentations of tt z corresponding to these three patterns, and 
therefore the numbers of times the representations X 1} X 2 , X 3 
of C appear in (c) 3 , are easily shown to be 1, 2, 1, (in accordance 
with the equation 3! = l 2 + 2 2 + l 2 ). Now the characteristic 
of the representation (c) 3 of c is 

h == ^ s z J (15.2) 

i 

the equation 

A = X x + 2X Z + X 3 = (2 s x + 5-2 + s 3 ) + 2X 2 
then allows us to conclude that 

X 2 == ^2 + 

We prefer to carry out the evaluation with the aid of the sums 
of powers 

k, k = Z<h • Z^i, k = Z<h ; 

i i i 

we then have 

^2 TT *^3 + ^ 2 j ^3 ^3 


in addition to (15.2). Consequently the characteristics in 
which we are interested are : 


Doublets : X 2 = ^(t x — t z ) 


Quadruplets : X x = ^ 


n 




(^2 ^ 3 ) 


(15.3) 

(15.4) 


The solution of the problem discussed above is now obtained 
by replacing the 21 ■+ 1 variables s £ by the set 


s l , s 1 "" 1 , • • 
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and then expressing t ly . t 2 , t z as a sum Ea L (L) of expressions of 
the form 

(L ) : e L + s 1-1 + • • • + 

with integral multiplicities a L . The computation is considerably 
simplified by multiplying both sides of the equation by s — 1, 
as (L) then becomes e Lfl — e“ z '. The multiplicities so obtained 
are given in the following tables : 

L= Zl, 3/ — 1, Zl- 2, — l | L= 0, 1, 2 


\ji\ 

Multiplicity : 

1 h 1 

Multiplicity : 


], 2, 3, • • ■ 

(increasing by i each step) 


L = 3/, 32-1, 3 1 -2,31- 3, 


1, 3, 5, • • 

(increasing by 2 
each step). 

■, 2 


,1 


Multiplicity . 


1, 0, 1, 0. 

(alternately 1 and o) 

l = 2, 1 - 1 , 2 — 2 , 1 - 3 , • • 

1, -1, i, -1, • 

(alternately 1 and — 1) 

L = 31, 31 -1,31- 2, 32 - 3, 3 1 — 4, 3/ — 5, 

1 , - 1 , 0 , 1 , - 1 , 0 , 

(repetition with period 3) 


On applying these results to the computation of X 2 , X x with the 
aid of (15.3) and (15.4) we find that the number of terms with 
total azimuthal quantum number L is as given in the following 
tables : 


Doublet System 


L= 0, 1, 2, 

3, 4, 5, 

• 

0 1 2 

2 3 4 

• 


(i) 


up to L — l. The period is here 3 ; the multiplicities in the 
second period are those of the first increased by 2, those in the 
third are obtained from those in the second by adding 2, etc. 

( 2 ) 


L = 31, 31-1,31-2 

| 31-3,31- 4, 32 - 5, 

. . . 

0 1 1 

i 1 2 2 



down to L = l. The periodicity is again 3, but the multiplicities 
in each period are obtained from those in the previous one by 
adding 1 instead of 2. J 

Quadruplet System. The periodicity is here 6 instead of 3. 

(1) For the values of 1. from 0 to / the first period of multi- 
Pj*: lties (-^ = °> 1 > 2 , 3, 4, 5) is for even l : 0 1 0 2 1 2 and for 
odd l : 10112 1. The multiplicities increase by 2 from period 
to period. 
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(2) For values of L from 3/ down to l the first period is 
0 0 0 1 0 1 regardless of whether L is odd or even, and the 
multiplicities are increased by 1 from period to period. 


§ 16. Determination of the Primitive Characters of 

tt and n 

The guiding principle in the whole of the present chapter 
is the reciprocity between the symmetric permutation group rr f 
and the algebra 2 of symmetric transformations. But this 
latter can, as was shown in § 1, be replaced by the special 
symmetric transformations induced in tensor space by the linear 
transformations of vector space and which constitute a group 
(c Y isomorphic with the linear group c. Indeed, we may even 
restrict c to the unitary group it. The algebra 2 is thereby 
referred to a group — not to a finite group, it is true, but to a 
closed continuous group. Now we have seen in Chapter III 
that we may expect such groups to behave in a manner entirely 
analogous to that met in dealing with finite groups, at least 
if we concern ourselves only with unitary representations. As 
a rule we find in mathematics that the continuum is more easily 
handled than a discrete manifold ; the formula (9.11), which 
expresses the fundamental reciprocity mentioned above, will 
therefore better serve to compute x from X than the converse. 

We therefore next evaluate the characteristics X of the 
continuous irreducible unitary representations of the n-dimen- 
sional unitary group it by a direct method which is independent 
of our previous development. The case n -- 1 has already been 
solved in III, § 8 ; the procedure there developed serves as 
a model for the present case. With this in mind we first prove 
the following auxiliary theorem : 

A continuous function f(co h oj 2 , * * •, io n ) of absolute value 
1 which possesses the period 277 in each of the n real arguments 
and which satisfies the functional equation 

/((« + «'))= /(H)/(K)) 

is necessarily of the form 

/((«)) = + h 2 co 2 + • * * + h n co n ) , 

where the constants h are integers. 

On introducing the n functions 

AH = /H o, o, • • o ),AH =/( o, <*>, o, • • •, o), • • • 
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of one variable, we are able to conclude from the functional 
equation above that 

/(<•>!, 0) 2 , • • •) = /l(ct>l )ft[(0 2 ) • • • 

It therefore suffices to prove the theorem for functions f(a>) of 
one variable, and this we have already done [III, § 8], 

Every element S of the group U is conjugate to a “ principal ” 
element E , i.e. to a transformation of the form 

x v e„ x v (y = 1, 2, • *, n). (16*1) 

The numbers e, are of unit modulus and may therefore be ex- 
pressed as 

e„ = e lt * v = e{oj v ) 

in terms of the “ angles of rotation ” c u 1} co 2) • • •, co n (which are 
only determined mod. 2tt) of the unitary transformation 5. 
In order to employ the orthogonality relations it is necessary 
to determine the volume dS of that portion of the group mani- 
fold U whose elements have angles between a> v and a> v + doo v . 
a i> a 2 , • * *, <&» being any n numbers, let D(a l9 a 2j • • *, a n ) denote 
the product 

n (a ( — tffc) = I a n -\ •••,«, 1 | 

i < k 

of differences ; the n rows of the determinant on the right are 
obtained by replacing a successively by a h a 2) • • •, a n . The 
evaluation of the volume element dS will be carried out in the 
following section ; we here anticipate the result 

dS — A A da> x da) 2 • • • dco ni A = D(s 1) e 2 , • • *, e n ). (16.2) 

The determination of the primitive characteristics of U is 
accomplished by combining the following important facts. 21 

1. Symmetry . — Each element S of u is conjugate to a prin- 
cipal element E ) (16.1). Hence it suffices to determine the 
characteristic X of a continuous representation of U for such 
a principal element. E goes over into a conjugate transforma- 
tion within u on permuting the e, : hence X is a continuous 
symmetric function of the angles to, and is of period 2n in each 
of them . 

2. Arithmetic Properties. — The principal elements constitute 
an Abelian sub-group of u ; on compounding two such elements 
E f E' the angles w P) w f v are added. The normal co-ordinates 
y k in representation space 9ft can therefore be chosen in such a 
way that the principal elements correspond to principal trans- 
formations 

B: y*-+p*y*i 
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indeed, we have shown in I, § 5, that any commutative system 
of unitary correspondences can be brought simultaneously into 
diagonal form. On compounding two principal elements the 
condition that £ be a representation is expressed by the functional 
equation . 

/>(«!, o> 2 , • • -)pK, <4 • • •) = pK + <4 <"2 + • • •) 

for each of the multipliers p = p*. The auxiliary theorem then 
tells us that each p k is of the form 

e (h x a*\ + • * * + h n a) n ), 


where the constants h are integers. The characteristic of the 
representation is the sum of these p k ; hence X is a finite Fourier 
series in the arguments o> with integral non-negative coefficients . 
The “ weights ” of a representation are the sets of exponents 
fa K * • •, K) of each term 


£ i 1 8 


^{hi^i + h$oy% + • • * + h n co n ) 

which actually appears in X. The term fa h 2 , • * K) is said 
to be “ higher” than fa hi • h' n ) if the first non-vanishing 

difference h x — h[, h 2 *— h%, • • is positive. 

3. Orthogonality. — For all primitive characteristics 


X the 


integral 


2 n 2* 


( • • ■ jXXAArfa., • • • da) n 
o o 


must have the value 

2n 2n 

V = j • • • jAAiaii • • • du>„. (16-3) 

0 0 

These orthogonality relations suggest that we introduce the 
quantities f = A • X in place of the characteristics X ; they 
are also finite Fourier series, but they are antisymmetric functions 
of the angles ai instead of symmetric ones. h lt h 2 , • • •, K being 
integers arranged in decreasing order 

h 1 >h i >- ■ • > K, ( 16 - 4 ) 

we construct the “ elemental sum ” 

£{K K • ■ -,K) — 2 ± + h 2 co a + • • • + ( 16 - 5 ) 

i.e. the alternating sum over the permutations of the arguments 
Co- the term which we have written down is the highest one 
in the sum. Every alternating Fourier series is a linear aggregate 
of such elemental sums ; since the coefficients of these sums are 
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integers, and in particular that of the u highest ” term is 1, 
every alternating Fourier series, such as f, with integral co- 
efficients can be expressed as a linear aggregate of the form 

! = c-£(h lt h t> - • ■) + c'^(h' 1 ,h' 2 r ••)+••■ (16.6) 

with integral coefficients c, c\ • • •. Let this expansion be 
arranged in decreasing order, i.e. in such a way that the set 
[h^ h 2 , • • •) of exponents is higher than ( Ji h ti 2 , • • •), etc. ; 
[hi, h 2 , ■ • •) is then the highest term in £. A is itself an elemental 
sum, namely 

A = f(» - 1, n — 2, • • •, 1, 0). 

Hence if the highest term in' X has exponents f x , / 2 , • • •, we have 
ki = A+(n— 1 ). * • •, K-i = fn-i + 1 , K = /« ! (! 6 . 7 ) 

in the following the numbers and h { are always in the relation 
(16.7) with one another . 

We denote integration with respect to all the angles of 
rotation from 0 to 2n by a single integral sign and write da) 
for da>i<la> 2 • * • dw n . We now calculate 

\1{K K • * -)£{K K, • • • ) dw ; 

the h and the h' are arranged in decreasing order in accordance 
with (16.4). Consequently no permutation of the h can coincide 
with a permutation of the h ' unless 

K = hi, h 2 = ti 2 , - * •, h n = ti n ; (16.8) 

the integral of each of the (n ! ) 2 terms in the product 

I(^i, ^2 j * • *) £{hi, h 2) • • ■) 

is therefore 0 unless (16.8) holds. In this latter case those n ! 
terms, for which the permutation of the h is the same as that 
of the h ! , each contribute (27r) n to the integral and all others 
contribute 0 ; hence 

K • • -mi K • • -)d<o = 

according as (16.8) holds or not. Applying this in particular 
to the elemental sum A, we find 

jAAdco = V — n ! (2t r)» 

On setting the expansion (16.6) in the equation 

den = V 


n ! (27r)" 
0 
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we find \c\ 2 + |c'| 2 + ■•• = !. Since the c, c', • • • are non- 
vanishing integers only the first term can appear in (16.6). and 
we must have c — 1 or — 1, and since the coefficient of the 
highest term of £ (as of X) must be positive we are restricted to 
the first alternative c = 1. We have thus shown that every 
primitive characteristic is of the form 


1 

N 

1! 

>< 

s\ e A », • • •, shn | 

A ~ | 

Is- 1 , • • ■, 8, 1| 


where the h t are integers arranged in decreasing order : h t >h 2 > • ■ 
The function defined by (16.9) is a finite Fourier series with 
the highest term (/ x , / 2 , • • •, /„) ; the coefficient of this term, its 
multiplicity, is 1. 

4. Completeness. — The last question to be answered asks 
whether every function of the form (16.9) is conversely the 
characteristic of some irreducible representation of u or not. 
Our explicit algebraic construction allows us to answer this 
question in the affirmative. To show this we first note that the 
representation of order / arising from the symmetry pattern 
with (at most n) rows of lengths / x , /*•■•,/„ has as highest 
weight (/j, f S) ■ • •, /„) ; this can be seen immediately by con- 
sidering the representation as generated by alternation from 
the product of n vectors, the first of which occurs f x times as 
a factor, the second / 2 , etc. (as in the simple case at the beginning 
of § 15). The / are here any integers satisfying the conditions 

A £/*£'• • — fn 2= 0. 


On dividing the transformation corresponding to the arbitrary 
element 5 of u in this representation by the Z th power of the 
determinant of 5 ( l being any fixed non-negative integer) the 
highest weight of the resulting transformation is (f x — l, 
/a — l, • • *,/„ — /); this simple device thus enables us to dis- 
pense with the restriction f n Sg 0. We have thus proved that 
all irreducible unitary representations of the unitary group u„ 
are obtainable by completely reducing the representations (u .)/ for 
f — 0, 1, 2, • • • into their irreducible constituents and multiplying 
by the 1-dimensional representations 

S (det. S) 1 [1 = 0, ± 1 , ±2, • • •]. 

We have further shown that the characteristic of the irreducible 
representation § — §(/,, / 2 , • • •, /„) of order f of u, which is gener- 
ated by the symmetry pattern P(f h /*,••■, /„), is given by equation 

(16-9). 
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We could also have obtained this last result with the more 
transcendental method of proof employed in steps 1 to 3. If 
we are operating in the continuum of all complex numbers 
rather than an arbitrary field the proof of the completeness of 
the irreducible representations of a finite group can be formulated 
in such a way that it can be taken over immediately for the case 
of a closed continuous group with the aid of the theory of integral 
equations. The particular application of this general group- 
theoretic completeness theorem to the group b 2 of rotations of 
a circle into itself yields the completeness of the Fourier orthog- 
onal system (m = 0, ±1, ± 2, • • •). Its application to 
the closed group u* yields the following two facts : (I) Every 
expression of the form (16.9) is in fact a primitive characteristic. 
For if it were not it would be a non-vanishing function of position 
on the group manifold — in fact, a class function — whose Fourier 
coefficient with respect to each irreducible representation 
vanishes ; it is indeed orthogonal to all other functions of the 
form (16.9). (2) We further find that the functions (16.9) 

constitute a complete set of orthogonal functions for symmetric 
periodic functions of a> x , co 2i • • •, <o n ; this result is of no particular 
interest, as it is a consequence of the completeness of Fourier’s 
orthogonal system in one dimension. Our general considerations 
(1) to (4) yielded so many properties of primitive characteristics 
that we were able to obtain an explicit expression for them from 
these properties alone. 

Consequences. — The assumption that h n = f n ^ 0 constitutes 
no actual restriction ; the characteristic is then a symmetric 
rational integral function of the s of order /. The e are in fact 
roots of the characteristic polynomial /(r) = det (rl — S) of 
the unitary transformation 5 ; it is therefore possible to express 
X rationally and integrally in terms of the coefficients of this 
polynomial, and therefore in terms of the coefficients of the 
matrix S. The restriction to the unitary group can then readily 
be removed, but we shall not go further into these considerations 
here. 22 

The dimensionality of the representation X is found by 
calculating X for the unit element, all of whose characteristic 
numbers e, are 1. On substituting directly in (16.9) we obtain 
the indeterminate form 0/0, so we proceed as follows. Take 
coi = {n — l)o>, o >2 — {yi — 2)o) } • • •, oo n = 0 co 

in terms of the single angle o>. The determinant in the numerator 
of (16.9) is then the alternating sum of the terms obtained from 
the product 

e{h x {n — l)a>) • e(h 2 {n — 2)a>) • • • e(h n 0<o) 
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by permutations of the numbers n — 1 , n — 2, • • *, 0 ; it is 
therefore equal to 

IMM}"" 1 , • • •. MM) 1 , i| 

or to the product of the differences of the expressions e(h 1 a}) ) 
e(h 2 a>), • * * obtained by subtracting any member of the set from 
any of the earlier members. On allowing w->0we have 

efoid)) — e(h 2 o>) ^ ioo{h x — h 2 ). 

The dimensionality N of the representation denoted by 
§(/i> A * * *, fn) i n the above is consequently 

(16.10) 

Evaluation of the Characters of 777 . — Having obtained explicit 
expressions for the characteristics of the representations of u n 
we now employ the connection between the representations of 
777 and lt n developed in § 9 to evaluate the primitive characters 
of 777 . In equation (9.12) x is the character and X the char- 
acteristic of the irreducible representations of 777 and U n , re- 
spectively, generated by the symmetry pattern P(f 1} / 2 , ••'•); 
in particular we must put X = 0 if the pattern has more than 
n rows. The sum is extended over all possible symmetry 
patterns P with /fields. The expression (16.9) for X then allows 
us to enunciate the following rule for the evaluation of x : Let 

Xhft ■ ■ • (h* - . • • •) (16.11) 

denote the value of the character of the irreducible representation 
^)(/ij f%i * • *) of 7Tf> which is generated by the symmetry pattern 

P(f lf / a , * * •), for an element 5 belonging to the class I = (t\i 2 • • •). 

Choose an arbitrary positive integer n and construct the sums 
o r 2 , * " * of powers of n independent variables e lf s 2 , • • *, e n and 
the product D(z X) s 2 , • * *, e n ) of their differences . The term (16.11) 
is then the coefficient of the term * * * s£ n [hi = fi + {n — i)] 
in the expansion of 

D( Sj, e a , ■ • •, £„) • cMa' ‘ ‘ ’• (16.12) 

We here assume that the pattern P has at most n rows ; hence 

if we wish to obtain all primitive characters of 7 77 we must choose 
n ^/. The rule shows that the components of the characters 
are integers. 

This result was obtained by Frobenius in a purely algebraic 
manner, without introducing the continuous group u . 23 But 


DjK K • • •, K) 
Dip - 1 , • • •, 1 , 0 ) 
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I believe that the real reason for the rule comes to light only 
when we consider this connection between the groups ir t and 
U„ — in particular, it enables us to understand why a second 
integer n in addition to / is involved. 

The dimensionality g of lj(/i, /*, • • •) is obtained by substitut- 
ing the argument .? = I ; H — f, H = i$ = • • • = 0 in the 
character x ■ Formula (9.12) is then 

o{ = *X, 


where the sum is extended over all patterns P(J\, f 2 , • • •)■ Since 
a x is the characteristic of the w-dimensional representation 
C ; 5 -> 5 of the group u by itself, this merely means that in 
the complete reduction of (c)f the irreducible representation 
• • •) appears exactly g times, as we already know. 
On substituting the explicit expression (16.9) for X we obtain 

a{- |s n— 1 , • • •, e, 1 1 = Eg • Is* 1 , s A ‘, • • *, s A "|. 

g is accordingly equal to the coefficient of e* 1 s^ 4 • * * in the 
expansion of the product on the left-hand side. The term 
± s^sl 1 • • • £*” in the expansion of the determinant must 
be multiplied by the term 


n 

(K-kji (h 2 -k 2 )\ 


-ei 1 "* 1 el 1 "** * • • 


of <t{ in order to obtain a contribution to the term e^el* • • • 
of the product. (k u k 2) • • •, k n ) here run through the per- 
mutations of n — 1, • • *, 1, 0 and g is accordingly equal to the 
alternating sum 

/! 2 ± (*1-*JI {K - h) l • • • 

<*) 

over these permutations, or equal to the determinant 


/! 


(h-n -f- 1) !’ 


/! 


^1 h 2 \ 


1 1 I 

(h - 1)!’ hi ! 

- h(h — 1) • • • (h — n + 2), • • •, h, l|. 


The rows of this determinant consist, on reading from right to 
left, of polynomials in h of degrees 0, 1, • • •, (n — 1) with highest 
coefficient 1. The determinant is therefore 
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and we finally obtain the simple formula 


_ /! Djhij h 2j 
g h x ! h 2 ! • 



(16.13) 


n is to be taken at least as large as the number of rows in the 
pattern P(f Xi / 2 , • * *) ; the reader should convince himself by 
direct calculation that the value of (16.13) remains unchanged 
on replacing n by n + 1. 

Frobenius’ rule for the character and this formula for the 
dimensionality are vastly superior to (14.7) for purposes of 
practical evaluation. 

As an example, we carry through the computations for the 
case of four electrons ; the results are given in the table below. 
The group 7 r 4 contains twenty-four elements which are divided 
into five classes of conjugates ; each of these classes is designated 
in the second column of the table by the values (i x i 2 • • *) as- 
sociated with it. The first column contains the number of 
elements in each of these classes, and the sign + or — indicates 
whether the class consists of even or odd permutations. Each 
of the five remaining columns contains the values of a primitive 
character for the classes in whose row they stand. The symmetry 
pattern to which each of these characters belongs is indicated at 
the head of the column by the numbers f 1} / 2 , * * * of elements in 
its rows. The first and the last of these columns may be filled in 
immediately, and the second and third with the aid of Frobenius’ 
rule. The fourth is then obtained from the second on noting 
that its symmetry pattern is the dual of that of 2 ; we need 
then merely to replace the values in the second column by their 
negative for the (-) -classes. Since patterns 2 and 3 contain 
but two rows we may take n = 2. Hence on writing x ) y in 
place of e x , s 2 we have merely to find the coefficients of x*y (for 
the column 31) and x z y 2 (for the column 22) in the following 
polynomials : 


(x - y)(x + y) 4 , 

(x — y)(x -f- y) 2 (A 2 + y 2 ) = [x + y) [x 1 — y a ) {x 2 + y 2 ) 

= [x 4- y)( x 1 — y 4 ) 


(x — y)(x 2 4- y 2 ) 2 , 

(x — y)(x + y)(x 3 + y 3 ) = ( * 2 — y 2 )(* 3 + y 3 ), 
(x - y)(x 4 + y 4 ). 


The dimensionalities of the five irreducible representations are 
contained in the first row ; they are 1, 3, 2, 3, 1. The verification 
of the orthogonality relations is left to the reader. 
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No. 

Elements. 

-XPattern. 






ClassN^ 

4 

31 

22 

211 

mi 

1 + 

4 

1 

3 

2 

3 

l 

6- 

21 

1 

1 

0 

-1 

-l 

S+- 

02 

1 

-1 

2 

-1 

l 

8-b 

101 

1 

0 

-1 

0 

i 

6- 

0001 

1 

-1 

0 

1 

-l 


§ 17, Calculation of Volume on u 

Consider the line elements going out from the unit point I 
on the group manifold U, i.e. the infinitesimal unitary trans- 
formations 8 S = \\hsxfi\l We may take as the real components 

of this “ vector ” the n quantities \ . 8 and the real and 

i 

imaginary parts of the n(n — l)/2 quantities 8s<x 0 (a </3) ; the 
total number of components is thus n 2 , which is therefore the 
dimensionality of the group manifold U. Now in a linear algebra 
of this kind we may replace any two real quantities a, b by the 
complex quantities a + ib, — a ■+ ib obtained from them by 
a simple linear substitution ; we may therefore replace the 
real and imaginary parts of 8s a0 (a < fi) by 8 s a0 itself and 

— S-Soc/j = SSpot. 

On transporting such an infinitesimal vector to the point 
5 on the group manifold by a left-translation its terminus goes 
into the point 5 + dS = 5(1 + §5), dS = 5 • 85 ; we must 
therefore consider the infinitesimal element 85 = S^dS as the 
41 vector” which leads from 5 to 5 + dS. Our definition of 
volume on the group manifold [III, § 12] consisted in the 
following : the parallelepiped defined by n 2 vectors 85 leading 
from the fixed point 5 to the neighbouring points 5 -f- dS has 
as volume the absolute value of the determinant formed from the 
components of the n 2 vectors 85. In accordance with the above 
remarks we may take as components of the vector 8 S = || 8j a/ j|| 
the totality of coefficients 8% themselves. 

Any 5 can be expressed in the form 

5 = UEU- 1 (17.1) 

where £ is a principal (diagonal) element of U and U is unitary. 
5 is unchanged on multiplying U on the right by any principal 
element. We employ a geometrical terminology which will 
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allow us to visualize our procedure by means of an analogy. 
Two elements 77, 77' of u which are right-equivalent with respect 
to the group of principal elements : 77' — UE ) will be said to 
“ lie on the same vertical [77].” From the w 2 -dimensional mani- 
fold U we obtain by projection the (n 2 — n) -dimensional mani- 
fold [it] of verticals [77] on considering all points of U which 
belong to the same vertical to be coincident. This process of 
identifying equivalent elements was described in general in the 
beginning of Chapter III — we had, in fact, already met it in I, 
§ 1, in the special case of projection in affine space. We may now 
consider 77 in (17.1) merely as a representative element of the 
vertical [77] ; on allowing [77] to run through the entire mani- 
fold [u] and the angles co„ of E : 





to vary independently over the complete range 0 ?£ co < 2rr 
the element S defined by (17.1) describes the manifold u exactly 
n ! times. 

The vector 817= 77~ 1 i77 leads from the point 77 of the vertical 
[U] to the neighbouring point 77 + dU of the vertical [77 + dU ]. 
The totality of all points on [77 + dU) which are in the neigh- 
bourhood of U is given by expressions of the form 

(77 + dU){ 1 + 8E) = 77 + (dU + U SE) 

where 8 E is an arbitrary infinitesimal principal element with 
coefficients i 8<o„ on the principal diagonal ; the corresponding 
vectors are §77 = 8 [7 + 8 E. Since the terms in the. principal 
diagonal of 877 are pure imaginary, E may be uniquely deter- 
mined in such a way that all terms in the principal diagonal of 
877 vanish; we call this transition from [77] to [77 + dU] the 
horizontal transition from 77.” — The transition from some other 
point UE of the vertical [17] to the point (77 + dU)E of [77 + dU] 
is accomplished by means of the vector 


S' 77 = E" 1 * 8U • E. s (17.2) 

That this linear transformation (17.2) determined by E , which 
sends 877 into 8' 77, is unimodular follows from our general re- 
marks concerning closed continuous groups — and can in this 
case be readily verified by direct computation. Naturally this 
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same equation holds for the horizontal transitions S[7, &'U from 
U, UE respectively : 

8 f U = E- 1 'hU-E. (17.3) 

n 2 — n horizontal vectors 8 U leading out from U determine an 
infinitesimal “ parallelogram ” whose content is measured by 
the absolute value of the determinant of the n 2 — n components 
8u afi (a =4= jS) of the various vectors SU. On allowing each point 
U on the periphery of the parallelogram to describe the vertical 
\U] we obtain a tube whose horizontal sections are parallelo- 
grams ; its projection on [u] is the original element of volume, 
the “ parallelogram ” defined by the 8 U. Since the linear 
transformation (17.3), tU ->■ 8' U, is unimodular, the content of 
each horizontal section is the same, and may therefore be con- 
sidered as the content of the volume element on [u]. 

We now examine the variations in [U] and E in (17.1) when 
5 goes over into 5 + dS. We have 

517 = UE 

and therefore 

dS ■ U + 5 • dU = dU • E + U • dE. 

On multiplying both sides of this equation by = E^U" 1 

we find 

Z7” 1 • 85 • 17 + 8 [7 = E~ x ■ 8C7 • £ + 8£ 
or 

VS ee t/- 1 - 85 ■ U = {E- 1 -8U-E-W} + 8E. (17.4) 
The components of the matrix contained in parentheses are 

x )- 

We now define a parallelepiped at S which shall serve as a 
volume element in the following manner : n 2 — n of the n 2 
sides 85 are obtained from (17.4) on allowing the angles of 
rotation to remain fixed, i.e. 823 = 0, and drawing n 2 — n hori- 
zontal vectors 8U from the point U to form a volume element 
of magnitude d[U] on [u] ; the remaining n vectors 85 are then 
chosen in such a way that for each of them one and only one of 
the angles co r changes by dw r and [U] remains unchanged. The 
corresponding n 2 vectors 8'5 define, in accordance with (17.4), 
an element of volume of magnitude 
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Since the linear transformation 85 -> 8 '5 = U^ 1 • 85 • 17 is uni- 
modular this volume is equal to that of the element defined by 
the 85 themselves. Since e = 1/e the product II in (17.5) can 
be written 

= I f (e« - *,)(5« - !„) = A • A. 

0<J> 

The final result is : The volume element described by 5 on allowing 
[U\ in (17.1) to describe an infinitesimal volume element of mag- 
nitude d[U ] on [it] and on allowing the angles of rotation w v to vary 
by da) v has the magnitude 

AKdco l dco 2 * • • da) n *d[U]. (17.6) 

On integrating with respect to d[U] over [u] we obtain the 
theorem, already v applied in the preceding section, concerning 
the magnitude of that portion of U in which the angles of rotation 
have values lying between co v and <o v + dco v . 

These considerations remain valid on restricting ourselves 
to the group u of unitary transformations with determinant 1. 
The angles of rotation are then subjected to the restriction 

«>i + <*>2 + * • • + w n = o, (17.7) 

and the only difference in the result is that the factor da> n in 
(17.6) is to be omitted. Condition (17.7) allows us to normalize 
the linear form h t co* + • • • + h n ou n in the angles of rotation in 
such a way that h n = 0 ; the exponents (h lt h 2} • • •, h n ) in the 
weights of the representations of u are then non-negative integers. 
It is desirable, however, not to impose this normalization h n = 0 ; 
we need then only to remark that only the differences between 
the hi are of significance : the irreducible representations 
&(A> /a, • * *, fn) of U are unchanged on increasing each of the fi 
by the same integer. In particular, these considerations justify 
the expression used in Chapter III for the volume on the group 
manifold of the unimodular unitary group li 2 , and the results 
of the preceding section constitute a direct proof, which is inde- 
pendent of the completeness theorem, of the fact that the 
representations of U 2 denoted by constitute a complete set of 
inequivalent irreducible representations of u 2 . 



390 THE SYMMETRIC PERMUTATION GROUP 


§ 18. Branching Laws 

Finally, we show the usefulness of our formulae for the 
characters by deriving two simple “ branching laws ” from them. 

1. Branching law for the Permutation Group . 

The irreducible representation of 7T f with the symmetry pattern 
P(fi, f 2) • * •) reduces , on restricting rr f to the sub-group 7r/_ 3 of 
permutations of f — 1 things , into the sum of those irreducible 
representations of associated with the patterns 

P(fi- !,/*/*•• •); 

P(fi,A - • •); 


those patterns in which the rows are not arranged in decreasing 
length are to be omitted. Each such constituent appears exactly 
once. (In words, these patterns are obtained from the original 
one by removing a field in turn from the end of each row which 
is actually longer than the following one.) 

Proof Let s be a permutation of the numbers 1, 2, • • •, 
/— 1 belonging to the class (i 3 — 1, i 2) z 3 , • * •). Considered as a 
permutation of the/numbers 1, 2, • • -,/, s leaves the last number 
fixed ; the number of one- term cycles is thus increased by 1, 
and s, considered as an element of belongs to the class 
(i h i 2) i 3 , • • •). In the expansion 

A • ej' 1 e 2‘ • ■ • (18.1) 

we have as the coefficients of those terms for which 
hi ^ h 2 • • 

aw,... = 0 or Xrvv^) (18-2) 

according as any of the signs ^ in the above inequalities is 
actually = or not. xr * s the primitive character of belong- 
ing to the symmetry pattern P(/l, f 2) • • •). On the other hand, 
the coefficient of ej 1 e$ • • • [h x > h 2 > • • •] in A • o l £ • • • is 
equal to the character Xa/i*v( 5 ) °f the representation of 777 
with pattern P(f 1) f 2) • • •). Hence on multiplying (18.1) with 
a i — e i + s 2 + • ‘ * + £« we find 

Xfift • • • i S ) = ^x-l, A*, • • • + &h Lt 7i t - 1, A #> • • • + * • •. 

Our branching law follows from this result and (18.2). The 
branching law leads to a recurrence formula for the dimension- 
alities g(/ a , / 2 , • • ■). 
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2. Branching law for c n . 

On restricting t n to the sub-group of linear transformations of 
an (n — \) -dimensional sub- space the irreducible representation 
(fir fzt • * *) of c n reduces into the sum of all those representations 
(fi fi • • •) of Cn-i for which 

fl ^fl £/. ^ • • • ^fn- 1 — in ; (18.3) 

each of these constituents appears exactly once. 

Proof. The linear transformations S of the sub-space c n _ x : 
x n == 0 are simply isomorphic to those linear transformations 
5* of the variables x X) x 2 , * • x n in which -> x n . Hence e n 
is to be replaced by 1 in the characteristic (16.9). The denom- 
inator is then 


^ 2 ; *> ^ n — l ) * (®1 1 ) (^2 1 ) (®»»— 1 1 )) 


as can be seen by subtracting the last column of D( e h e 2 > * * *, 
e n~x» 1) from each t ^ ie previous ones and factoring the resulting 
(n — l)-row determinant. In order to divide the determinant 
in the numerator by the factor (e x — l)(e 2 — 1) • - • (£„_* — 1) 
we subtract the second column from the first, the third from the 
second, • • *, and finally the n th from the (n — l) 8t . The last 
row then is 0, 0, • • •, 0, 1 ; the determinant is thus reduced to 
a determinant of order (n — 1). Now divide each element in 
the row by e v — 1 in accordance with 


6^1 


i 


+ • • • + e A *. 


The result is that we then have in the numerator the determinant 
[ e V-i 4- e A « _1 + • • • + eS • • -| 

(s — £ l! ^2, ■ "t 

But this is the sum of all (n — l)-rowed determinants of the form 

hi > h[ ^ h 2 > h' 2 sg h 3 > • ■ ■ > Ki-i ^ h n (18.4) 

On subtracting n — 1 from h lt n — 2 from h. i and h 2 , • • ■, 0 
from h' n _i and h n , in order to obtain the numbers f [(16.7)], che 
inequalities (18.4) become the inequalities (18.3) and our theorem 
is proved. 




APPENDIX 1 

Proof of an Inequality 

(Page 77.) 

In order to prove the inequality stated on page 77 we must 
show that any continuous and differentiable function i (j, which 
is defined for all values of the real variable x, satisfies the 
condition 

j(}«*)'s Jff*, (*) 

— 00 — 00 — CO 

provided, of course, that the integrals involved actually exist. 
The Schwarz inequality 

\ a l^l + ' * * + a J>n | 2 =a ( a l&l + • * * + Q'rfl'n) (^iPl + * * * + &n&n) 

employed in Chapter I becomes, on replacing the sums by in- 
tegrals — or rather each sum by two integrals — 

| \figJx + \f i g i dx\ i ^ {\fjidx + \f t f'fix)[\g&dx + 

Applying this inequality to 

4 (W > 

by taking 

fi = x*,f» = xf, Si=^, gt=% 

and transforming the integral 

into — ^ipijjdx 

by partial integration over the range —oo, +oo, we obtain the 
desired relation (*) provided the term which is integrated 
out, approaches 0 as x-> ± oo. That this is actually the case 
if the two integrals on the right of (*) converge can be seen by 
the following indirect proof. Let e be any pre-assigned positive 
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constant and consider a positive value of x for which x \ |* > e 

00 

and which is so large that jj^r I dx g The Schwarz inequality 




then tells us that for x ^ x' ^ x + - 

x 


2 

dx 


I </-(*') - #*01* whence |##')| S |#*)| - 


The integral of x* U | 8 over the range from x to x -f- - is then 

X 

„ 1 s e e a 

> %* ‘ -7 ~ ~ = T- 

4 x x 4 

Hence it follows that conversely 


00 00 



X X 


imply the inequality 


*|#*)|* £e. 



APPENDIX 2 

A Composition Property of Group Characters 

{Page 169.) 

The fundamental property of the irreducible representation 
^ ^ -*■ U{s) which is expressed in the equation 

U(st) = U(s)U(t) 
is paralleled by the relation 

x( s )x(t) = lZx(sr-Hr). (*) 

Proof If x, jk are two elements of the algebra of the group, 
the second of which belongs to the central, and if 

x -> X y y -> Y in 

then Y = - 1 . The matrix associated with z = xy in § is 

^-X and its trace is — : 

£ £ 

r*(r)xw = J zx(s)x(s) • ry(<)xW- 

r g s t 

On setting 

z(r) = Zx(s)y(t) (st = r) 

we find 

Zx{s)y(t) x (st) = ^E#(*)y(*)x(*)xW- 
* S *,t 

Since y(£) depends only on the class of conjugate elements to 
which t belongs we may replace 

X{st) by IZxisr-Hr) 

on the left-hand side of the previous equation. Then the co- 
efficient of x(s)y(t) on either side of the equation depends only 
on the class to which the element t belongs, and since x(s) is an 
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arbitrary function, y(t) an arbitrary class function, the assertion 
(*). follows from the fact that the two coefficients must agree. 

We have omitted mention of this equation (*) in the text 
in order not to interrupt the systematic development of the 
theory of representations, which is completely described by the 
orthogonality relations and the completeness theorem. 



APPENDIX 3 


A Theorem Concerning Non-degenerate Anti- 
symmetric Bilinear Forms 

{Page 274.) 

We consider the given non-degenerate anti-symmetric bi-linear 
form 

f 

ZCikXiJk {c ki = — c ik ) 

t, k = 1 

as the “anti-symmetric product” [jt)] of the two vectors 
j = (#j, x 2} * * *, x f) and = (y^)- Let e x be any non-vanishing 
vector ; then by assumption [e x j] cannot vanish identically in 
j, and consequently a second vector e 2 can be found such that 
[e x e 2 ] = 1. The simultaneous equations 

[<h£] = 0, [e 2 j] = 0 

then have/ — 2 linearly independent solutions e 3> * * e f . These 
vectors arc furthermore such that no linear dependence can 
exist between them and e 1; e 2 , for if 

E = + £ 2 e 2 + Zip* + • • • + f/C/ = 0, 

it follows on building the anti-symmetric products [e x j] = £ 2 , 
[e 2 j] = — ■ that f 2 = 0. We may therefore choose 

e i, e 2 ) * * *, 6/ as a co-ordinate system, i.e. as a basis from which 
all vectors may be constructed. Let the anti-symmetric pro- 
duct be expressed in terms of the components rj k of j, 1) in 
this new co-ordinate system by 

f 

fety] = ZviktiVk- 

i, k - 1 

The manner in which the new fundamental vectors were deter- 
mined requires that of the coefficients y ik = [e t -e fc ] 

Yu = y« = 1 ; yw = 0, • • •, y lf — o, 

Yn = — 1, y 22 = 0 ; y 23 = 0, • • •, y v = 0. 
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In consequence of the anti-symmetry all y ix , y ia with i = 3, • • •,/ 
vanish, and the matrix of the y ia is completely reduced into the 
2-rowed square sub-matrix 

0 1 
-1 0 

and an (f— 2)-dimensional anti-symmetric matrix. Mathe- 
matical induction with respect to the dimensionality / yields the 
desired theorem that / is necessarily even and that the original 
form can be transformed into 

(£i7s — + (&?* - ZiVz) + * • ' (//2 terms) 

by an appropriate linear transformation. 
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30, 641 (1929), in particular §§ 3 and 16. The uniqueness of com- 
plete reduction rather than reduction follows in general W. Krull, 
Math. Zeits. 23, 161 (1925) ; O. Schmidt, Math. Zeits. 29, 34 
(1928) ; R. Brauer and I. Schur, Sitzungsber. Preuss. Akad. 1930, 
209. 


(5) 152. Schur's treatment of the theory of representations, cited in 

( 3 ) is based on this lemma. 

(6) 153. W. Burnside, Proc. Lond. Math. Soc. (2), 3, 430 (1905). 

(7) 156. G. Frobenius and I. Schur, Sitzungsber. Preuss. Akad. 

1906, 209. 

(8) 161. The method of integration over the group manifold is due to 

A. Hurwitz, Gott. Nachr. 1S97, 71, although it was applied by him 
to the theory of invariants rather than to the theory of groups. 
I. Schur first obtained the orthogonality properties of the 
characteristics of the continuous rotation group in this way and 
used them to prove the completeness of the system of known 
representations : Sitzungsber. Preuss. Akad. 1924, 189, 297, and 
346. 

(9) 166. For a modern book on algebra see L. E. Dickson, Algebras 

and their Arithmetics (Chicago 1923) ; the German edition, 
Algebren und ihre Zahlentheorie (trans. by J. J. Burckhardt and 
E. Schubarth, Zurich 1927), follows an author’s revision which has 
not appeared in English. Also B. L. van der Waerden, Modeme 
Algebra II (Berlin 1931). An algebra was previously called a 
<4 system of hyper-complex numbers,’ ' and is at present to some 
extent in the German literature ; the algebra of a group is there 
referred to as a " Gruppenring.” The usual procedure in modern 
algebra consists in reducing the algebra into simple matric 
algebras, in which case the theorems on realization by linear trans- 
formations appear as corollaries ; this development will be followed 


in Chap. V. 

(10) 173. See R. Weitzenbock, Invariantentheorie (Groningen 1923). 
The foundation for the proof of the fundamental theorem of the 
theory of invariants is the Hilbert basis theorem : D. Hilbert, 
Math. Ann. 36, 473 (1890). The author has shown (Math. Zeits. 
24, 392, 1926) that the fundamental theorem is valid for any closed 
and for any semi-simple continuous group. The older theory of 
invariants was almost exclusively concerned with the group c« 
of all linear transformations with unit determinant. A really 
modem book on the theory of invariants is lacking. 

(11) 175. The theory has been presented by S. Lie himself, with the 
assistance of F. Engel, in a huge three-volume work : Theorie der 
Transformationsgruppen (Leipsic 1893, 1930). See also S. Lie 
V orlesungen iiber kontinuierhche Gruppen, ed. by G. ocheffers 
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(4) 194. Appropriate methods for carrying through the perturbation 
calculations (method of the " self-consistent field ”) have been 
developed by : D. R. Hartree, Proc. Cambr. Phil. Soc. 24, 89 
(1928). J. A. Gaunt, Proc. Cambr. Phil. Soc. 24, 328 (1928). 
Also J. C. Slater, Phys. Rev. 32, 339 (1928) ; 34, 1293 (1929) ; 36, 
57 (1930). E. U. Condon, Phys. Rev. 36, 1121 (1930) ; E. U. 
Condon and G. H. Shortley, Phys. Rev. 37, 1025 (1931). V. 
Fock, Zeits. f. Phys. 61, 126 ; 62, 795 (1930). G. Breit, Phys. 
Rev. 35, 569 ; 36, 383 (1930). W. Heitler and G. Rumer, Zeits. 
f. Phys. 68 , 12 (1931). 

(5) 201. See the report by H. Honl, Ann. d. Phys. (4) 79, 273 (1926). 
For a derivation of the formulae on quantum, mechanics, although 
not from the group-theoretic standpoint, see M. Born, W. Heisen- 
berg and P. Jordan, Zeits. f. Phys. 35, 557 (1926). Also in Chap. 
IV of Born and Jordan, Elementare Quantenmechanik. 

(6) 203. W. Pauli, Zeits. f. Phys. 43, 601 (1927). 

(7) 203. G. E. Uhlenbeck and S. Goudsmit, Naturwiss. 13, 953 

(1925) ; Nature 117, 264 (1926). 

(8) 205. O. Richardson, Phys. Rev. 26, 248(1908). A. Einstein and. 
W. J. de Haas, Verhandl. d. Deutsch. Phys. Ges. 17, 152 (1915) ; 
18, 173 (1916). E. Beck, Ann. d. Phys. (4), 60, 109 (1919). 
S. J. and L. J. H. Barnett, Phys. Rev. 17, 404 (1921). A. P. 
Chattock and L. F. Bater, Phil. Trans. Roy. Soc. 223, 287 (1922). 

(9) 207. A report on a unified notation for the designation of terms of 
atomic spectra in terms of quantum numbers has been presented 
by H. N. Russell, A. G. Shenstone and L. A. Turner, Phys. 
Rev. 33, 900 (1929). It has also been found necessary to ascribe 
a spin to the atomic nucleus in order to account for the hyper-fine 
structure : E. Back and S. Goudsmit, Zeits. f. Phys. 43, 321 (1927) ; 
47, 174 (1928) ; S. Goudsmit and R. F. Bacher, Phys. Rev. 34, 
1501 (1929) ; S. Goudsmit, Phys. Rev. 37, 663 (1931). J. 
Hargreaves, Proc. Roy. Soc. 124 (A), 568 (1929). E. Fermi, 
Zeits. f. Phys. 60, 320 (1930). G. Breit, Phys. Rev. 37, 51 (1931). 

(10) 209. E. Back and A. Lande, Zeemaneffekt und Multiplettstruk- 
tur (Berlin 1925) A. Land£, Zeits. f. Phys. 15, 189 (1923). W. 
Pauli, Zeits. f. Phys. 16, 155 ; 20, 371 (1923). A. Land£, Zeits. 
f, Phys. 25, 46 (1924). W. Heisenberg and P. Jordan, Zeits. f. 
Phys. 37, 263 (1926). K. Darwin, Proc. Roy. Soc. 118 (A), 264 
(1928). For ( jj ) and (si) coupling see J. H. Bartlett, Phys. Rev. 
35, 229 (1930). 

(11) 210. H. Weyl, Math. Zeits. 23, 292 (1925). J. v. Neumann 
and E. Wigner, Phys. Zeits. 30, 467 (1929). 

(12) 210. Proc. Roy. Soc. 117(A), 610; 118, 351 (1928). C. G. 
Darwin, Proc. Rov, Soc. 118 (A), 654 (1928). A. Land£, Zeits. 
f. Phys. 48, 601 (1928) ; in the same volume F. Moglich, 852, 
and J. v. Neumann, 868. V. Fock, Zeits. f. Phys. 55, 127 (1929). 
For the older work concerning the interaction of spin and orbital 
moment of momentum see L. H. Thomas, Nature, 117, 514 (1926) ; 
J. Frenkel, Zeits. f. Phys. 37, 243 (1926) ; W. Heisenberg and 
P. Jordan in the same volume, 863. 

(13) 217. P. A. M. Dirac in Quantentheorie und Chemie, Leipziger 
Vortrage, 1928, 83 (Leipsic 1928). 

(14) 220. H. Weyl, Proc. Nat. Acad. Sci. 15, 323 (1929) ; Zeits. f. 
Phys. 56, 330 (1929). V. Fock, Zeits. f. Phys. 57, 261 (1929). 
V. Ambarcumian and D. Ivanenko, C. R. Acad. sc. USSR. 1930, 45. 



BIBLIOGRAPHY 


406 

PAGE 

(15) 224. See Wentzel's report cited in II < 1# ) ; A. Sommerfeld, Wave 
Mechanics ; Born and Jordan, Elementare Quantenmechanik ; O. 
Klein and Y. Nishina, Zeits. f. Phys. 52, 853 (1929). Y. Nishtna, 
same volume, 869. 

(16) 237. A. Sommerfeld, Ann. d. Phys. (4) 51, 1 (1916). For the 
significance of these results for the theory of X-ray spectra see 
Sommerfeld ' s book cited in the introduction. Perturbation 
calculation in the new quantum mechanics, W. Heisenberg and 
P. Jordan, lx. ( 10 ) ; exact derivation by means of the Dirac theory 
of the electron ; W. Gordan, Zeits. f. Phys. 48, 11 (1928) ; C. G. 
Darwin, l.c. ( la ) ; A. Sommerfeld, Wave Mechanics, p. 257 ff. 

(17) 241. W. Heisenberg, Zeits. f. Phys. 38, 411 (1926). Correspond- 
ing energy calculation for He atom ; W. Heisenberg, Zeits. f. 
Phys. 39, 499 (1926). P. A. M. Dirac, Proc. Roy. Soc. 112 (A), 
661 (1926). J. A. Gaunt, Proc. Roy. Soc. 122(A), 513 (1929) ; 
Phil. Trans. Roy. Soc. 228 (A), 151. Y. Sugiura, Zeits. f. Phys. 44, 
190 (1927). W. V. Houston, Phys. Rev. 33, 297 (1929). J. C. 
Slater, Phys. Rev. 32, 349 (1928). G. Breit, Phys. Rev. 34, 
553 (1929); 36, 383 (1930). The " symmetric n sub-space leads 
to the Einstein-Bose statistics, which is discussed in the references 
cited in II (•) above. The statistics arising from the “ anti-sym- 
metric " sub-space was developed by E. Fermi, Zeits. f. Phys. 
36, 902 (1926) and applied by W. Pauli, Zeits. f. Phys. 41, 81 
(1927), to the explanation of paramagnetism and by A. Sommerfeld 
to the electron theory of metals : A. Sommerfeld, W. V. Houston 
and C. Eckart, Zeits. f. Phys. 47, 1 (1928). 

(18) 244. E. C. Stoner, Phil. Mag. (•) 48, 719 (1924). W. Pauli, Zeits. 
f. Phys. 31, 765 (1925). It is to be remembered that this develop- 
ment antedates the new quantum theory and the theory of the 
spinning electron, and that Pauli's introduction of the four 
quantum numbers n , /, j, m demanded a complete re-classification 
of all spectroscopic material. 

(19) 248. P. A. M. Dirac, Proc. Roy. Soc. 114 (A), 243 (1927). On 
taking the interaction of the particles into account : P. Jordan 
and O. Klein, Zeits. f. Phys. 45, 751 (1927). 

(20) 250, 280. P. Jordan and E. Wigner, Zeits. f. Phys. 47, 631 
(1928). 

(21) 253. P. Jordan and W. Pauli, Zeits. f. Phys. 47, 151 (1928). 
G. Mie, Ann. d. Phys. 85, 711 (1928). W. Heisenberg and 
W. Pauli, Zeits. f. Phys. 56, 1 (1929) ; 59, 168 (1930) ; W. Heisen- 
berg, Zeits. f. Phys. 65, 4 (1930) ; Ann. d. Phys. 9, 338 (1931). 
L. Rosenfeld, Zeits. f. Phys. 63, 574 (1930). J. R. Oppenheimer, 
Phys. Rev. 35, 461 (1930). G. Breit, l.c. ( 17 ). E. Fermi, Rend. 
Acc. d. Lincei (6) 9, 181 (1929). L. Landau and R. Peierls, 
Zeits. f. Phys. 62, 188 (1930). L. Rosenfeld, Ann. d. Phvs. (5) 
5, 113 (1930). 

(22) 257, H. Weyl, Joum. f. d. reine u. angew. Math. 141, 163 (1912). 

(23) 261. See P. Jordan, Die Lichtquantenhypothese, in : Ergeb- 
nisse der exacten Wissenschaften, 7, 158 (1928). 

(24) 262. P. A. M. Dirac, Proc. Roy. Soc. 126 (A), 360 (1930) ; Proc. 
Cambr. Phil. Soc., 28, 361 (1930). J. R. Oppenheimer, Phys. 
Rev. 35, 939 (1930) . For a report on this theory see P. A. M. Dirac, 
Nature, 126, 605 (1930). For an attempt to avoid the negative 
energy levels by a reduction of all operators see E. SchrOdinger, 
Sitzungsber. Preuss. Akad. 1931, 63. 
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(25) 264, See articles by Heisenberg-Pault and Rosenfeld cited 
in < 21 ). 

(26) 276. H. Weyl, Zeits. f. Phys. 46, 1 (1927). 

(27) 280. A rigorous proof of these theorems for oo-dimensional space 
has been announced by M. H. Stone, Proc. Nat. Acad. Sci. 16, 
172 (1930) ; J. v. Neumann informs me in a recent letter that he 
has also obtained a proof of this theorem. 


CHAPTER V 

(1) 284. The transition from the group T 0 to the algebra 2 } which is 
suggested by quantum mechanics, has also improved the theory 
from the purely mathematical standpoint ; see H. Weyl, Ann. 
of Math. (2) 30, 499 (1929). The connection between the repre- 
sentations of u n or C n and 7T f was first clearly seen by I. Schur in 
his Dissertation (Berlin 1901). Further see : H. Weyl, Math. 
Zeits. 23, 271 (1925) ; I. Schur, Sitzungsber. Preuss. Akad. 1927, 
58 ; 1928, 100. On the symmetry classes of tensors see : A. Young, 
Proc, Lond. Math. Soc. 33, 97 (1900) ; 34, 361 (1901). H. Weyl, 
Rend. Circ. Mat. Palermo, 48, 29 (1924). 

(2) 287. This has been emphasized by P. A. M. Dirac, Proc. Roy. 

Soc. 123 (A), 714 (1929). 

(3) 291. G. Frobenius used the term “ characteristic unit M for 
this concept (see Sitzungsber. Preuss. Akad. 1903, 328), and this 
name has been taken over into the physical literature. But in the 
meantime the term " idempotent ” has been used in systematic 
investigations on algebras. The notions of “ right- and left-invari- 
ant sub-algebra ” and " left-invariant sub-algebra ” correspond 
with those of " ideal ” and “ left-ideal " in arithmetic when all the 
elements of the algebra are considered as “ integers/" 

(4) 303. E. Steinitz, Journ. f. d. reine u. angew. Math. 137, 167 (1910). 

(5) 307. Our proof of this theorem follows E. Noether, Math. Zeits. 

30, 641 (1929). 

(6) 313. In the older investigations T. Molien (Math. Ann. 41 and 
42, 1893) and G. Frobenius operate in the field of all complex 
numbers. The extension to arbitrary fields is due to J. H. M. 
Wedderburn, and is also valid for algebras which are not com- 
pletely reducible — a branch of the subject into which we have not 
entered : J. H. M. Wedderburn, Proc. Lond. Math. Soc. (2) 6, 99 
(1907) ; Bull. Am. Math. Soc. 31, 11 (1925). See also the book by 
Dickson referred to in III ( fl ). Our proof follows E. Noether, l.c. 
( 6 ). See further E. Artin, Abh. Math. Semin. Hamburg, 5, 251 
(1927) ; G. Kothe, Math. Zeits. 32, 161 (1930). 

(7) 320. E. Wigner, Zeits. f. Phys. 40, 4,92 and 883 (1926-27). W. 

Heitler, Zeits. f. Phys. 46, 49 (1927). Only the simplest case, 
that in which the unperturbed term of V consists of / different, 
non-degenerate terms of the individual J, is considered in detail 
in these papers. 

(8) 328. This direct derivation follows H. Weyl, Math. Zeits. 23, 

271 (1925). 

(9) 338. See G. Frobenius, Sitzungsber. Preuss. Akad. 1898, 501. 

(10) 340. yv. Heitler and F. London, Zeits. f. Phys. 44, 455 (1927). 
W. Heitler, Zeits. f. Phys. 46, 47 (1927) ; F. London, in the same 
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volume, 455. W. Heitler, Gott. Nachr. 1927, 368 ; Zeits. f. Phys. 
47 , 835 (1928). F. London, Zeits. f. Phys. 50 , 24 (1928). W. 
Heitler, Zeits. f. Phys. 51 , 805 (1928). M. Delbruck, Zeits. f. 
Phys. 51 , 181 (1928). F. London, in: Quantentheorie und 
Chemie, Leipziger Vortrage 1928, 59 (Leipsic 1928) ; Zeits. f. 
Phys. 63 , 245 (1930). M. Born, Zeits. f. Phys. 64 , 729 (1930). 
J. C. Slater, Phys. Rev. 37 , 481 ; 38 , 1109 (1931). L. Pauling, 
Joum. Ann. Chem. Soc. 53 , 1367 (1931). 

(12) 342 . The calculation is carried through in the first paper by 
Heitler and London cited in ( 10 ). Further see : Y. Sugiura, 
Zeits. f. Phys. 45 , 484 (1927). S. C. Wang, Phys. Rev. 31 , 5l9 
(1928) ; 28 , 663 (1927). E. C. Kemble and C. Zener, Phys. Rev. 
33 , 512 (1929). P. M. Morse and E. C. G. Stuckelberg, Phys. 
Rev. 33 , 932 (1929). 

(13) 346. Zeits. f. Phys. 50, 24 (1928). 

(14) 347 . Zeits. f. Phys. 49 , 619 (1928) ; SoMMERFELD-Festschrift : 
Probleme der modemen Physik (Leipsic 1929). 

(15) 357 . P. A. M. Dirac, l.c. ( a ). For a detailed term calculation 
following this scheme and examples see papers by Slater, Condon, 
Condon-Shortley, Born-Rumer cited in (4) above. 

(16) 358 . The introduction of the symmetry operators c into the 
theory of invariants is due to A. Young, l.c. C 1 ). But he proved the 
irreducibility of neither f) c nor £ c ; that of the first was proved by 
G. Frobenjus, Sitzungsber. Preuss. Akad. 1903, 328, and that of 
the latter by E. Cartan, Bull. Soc. Math. d. France, 41 , 53 (1913) 
and H. Weyl, l.c. ( 8 ). The symmetry classes were re-discovered in 
quantum mechanics by F. Hund, Zeits. f. Phys. 43 , 788 (1927). 

(17) 362. The development from theorem (14.2) to (14.8) follows a 
train of thought communicated to the author in a letter from 
J. v. Neumann. 

(18) 370 . See F. Hund, l.c. ( 1# ) ; J. v. Neumann and E. Wigner, Zeits. 
f. Phys. 47 , 203 ; 49 , 73 (1928). 

(19) 370. F. London, Zeits. f. Phys. 46 , 455 (1928). 

(20) 372. W. Heitler, Zeits. f. Phys. 51, 805 (1928). 

(21) 378 . Follows H. Weyl, l.c. ( 8 ). In the same way the character- 
istics of the rotation group in ^-dimensional space, the " complex 
group ” and all semi-simple groups can be calculated : Math. 
Zeits. 24 , 328, 377 and 789 (1926). 

(22) 382. L.c. ( 8 ). On removing the unitary restriction, the proof 
that we here obtain all irreducible representations of t n requires 
the use of the infinitesimal elements of the group. The knowledge 
won for u n has been carried over to c n under the broadest assump- 
tions by J. v. Neumann, Sitzungsber. der Preuss. Akad. 1927, 26 ; 
Math. Zeits. 30, 3 (1929) ; and I. Schur, Sitzungsber. Preuss. Akad 
1928, 100. 

(23) 383 . Sitzungsber. Preuss. Akad. 1900, 516. 


OPERATIONAL SYMBOLS 


The mirnber refers to the page on which the symbol is defined 

with ... is associated ... 110, 114. 

-3 is contained in 290. 
x conjugate complex of x 15. 

* transposition : for operators 13, symmetry quantities 352, 
symmetry patterns 361. 

- Hermitian conjugate : for operators 17, elements of an 
algebra 167. 

^ a(s) = a(s~' 1 ) 296. 

contragredient matrix 123, representation 123. 

£2. equivalent as correspondences of the ray field 21. 

~ transforms as 145. 

( ) scalar product 16, 32. 

[ ] vector product (in 3-dimensional space) 27 ; commutator 
[HA] = -{HA - AH) 264. 

< ) temporal mean value 88. 

X for vectors 90, vector spaces 90, correspondences 91, 
representations 126, groups 127, algebras and their 
elements 333. 

X multiplication of representations of two groups 127. 

+ addition of representations 113. 

| transition from p to 287. 

jf transition from 5)3 to p 290. 


409 



LETTERS HAYING A FIXED SIGNIFICANCE 


The number refers to the page on which the quantity is defined 

LATIN 

c velocity of light ; a Young symmetry operator 359. 
e primitive idempotent element (generating unit 291) ; — e- 
charge of the electron. 
e(x) = e*®. 

(E„, E v , E t ) = @ electric field strength 99. 

E t energy level 44. 

/ number of electrons, order of a tensor 139, 281. 

/„ 4-vector potential multiplied by ejch 214. 

f- c»rlo J y.(=^-^)21«. 

F action of the electro-magnetic field 215. 

F (ij, i 2) . . if) tensor 139, 281. 

g dimensionality of a group representation 120, Land6 g - 
factor 204, 207. 

h Planck’s quantum of action divided by 51, order of a 
finite group 118. 

H energy 51. 

(H x , H y , H z ) = Ip magnetic field strength 99. 

I signature 188. 

j ) y inner quantum number 189, 190. 

total energy-momentum vector 220. 
k auxiliary quantum number 228. 

Z, L azimuthal quantum number 64, 185, 194 — for s , p , d , /, g } 

, . . terms Z = 0, 1, 2, 3, 4, . . . k 
(L x , L V} L x ) = £ orbital moment of momentum 63. 
m magnetic quantum number 64, 193, multiplicity of a re- 
presentation 321, 350 ; (= /x) mass of the electron. 
m 0 = mcjh, 

M, M f action of the material field 211. 

[M m M V) M m ) = 2ft total moment of momentum 179, 187. 
n dimensionality of a vector space 1; principal quantum 
number 69. 
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LETTERS HAVING A FIXED SIGNIFICANCE 411 


LATIN 

p, q canonically conjugate variables 94, a permutation in the 
rows, columns of a symmetry pattern 359, 

(Px, Pv, Pz) — P linear momentum of a particle 51. 

P symmetry pattern 358. 

(?*> izj = 9 electric dipole moment 83. 
r distance from centre. 

s element of a group : spin quantum number 206. 


(s Xf S V) 

Sz) 

= 3 electric 

current 

4-vector 214. 



(S Xi Sy, S, 

,) = <5 spin 178, 

203. 


1 

0 

0 

i 


So = 





,s t 


0 

1 

1 

0 



0 


, 5 ; 


148. 


T interchange of ifr l9 and \jj x ) ifj 2 r 149. 
v valence 369. 

W perturbation energy 86, total action 216. 
x 0 x x x 2 x z or t x y ^co-ordinates of space time (t = x 0 98, or ct = x 0 
211 ). 


german. (For 3-dimensional vectors see their components 
under Latin letters.) 

C = c n group of (unimodular) linear transformations in n dimen- 
sions 128. 

(c)f representation of c whose substratum is the tensors of 
order / 125. 

=5 = 2 j) representation of z;th degree of C 2 or xi 2 ^ b 3 

128, 142. 

b n orthogonal group in n dimensions 142 ; b’ n same but in- 
cluding improper rotations 143. 

35 (w) 1-dimensional representation of rotation group b 2 141. 
e 2 > . . e n co-ordinate system in vector space 2. 

6 unitary representation of the rotation group induced in 
the function space of ip(x y z ) 143. 

g abstract group 114. 

f a conjugation 118. 

m mean value 158. 

91 representation of the rotation group induced in system 
space 187. 

J), 5(5 invariant sub-space of r, ffi respectively 287, 282. 

n 

t an algebra considered as a vector space 286, t 0 = t == a W 
290, 350. 
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GERMAN 

JR vector space |, 9V corresponding space of tensors of order /, 
[{Rfj space of the symmetric tensors, {SR/} space of the 
anti-symmetric tensors, 239, 242. 

5R ( , $R„ system space of electron translation, spin 196. 
t B left-translation 116. 

U = U„ (unimodular) unitary group in n dimensions 139. 

SB ray representation giving rise to algebra of complex 
quaternions 182. 

% vector in n dimensional vector space 1. 

GREEK 

a = e*lck fine structure constant 216. 

8 {k Kronecker symbol = 1 or 0 according as i — k or i 4 s k 17. 

4* t 

8(jc) Dirac 8-function (== 0 except for x — 0 and f 8(x)dx ~ 1) 
255. 

8, = ± 1 according as S’ is an even or an odd permutation 121. 
8 signature 201. 

<j* S 1 

& — Laplace’s operator a 52. 

'“^•5 212 ' 

e generating clement of a right- and left-invariant sub-space 
311. 

S, <f> polar co-ordinates 60. 

/*(= m) mass of the electron. 
v frequency 50. 

o = Larmor factor — unit of Zeeman separation. 

w =*= ir f symmetric group of permutations of /objects 121. 
p electric charge density 218, an algebra 304. 

<f> a electro-magnetic 4- vector potential 98. 

iff vector defining the state of the material field 49. 

X, X group characteristics, 150, 151. 

(v angle of rotation 151. 
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The numbers refer to pages of the text, 
concepts introduced ir, 

Abelian group 1 X 8 , its unitary irreduc- 
ible representations 140,^ in ray space 
182, quantum kinematics as A. g. 
of rotations 272 ff. A. system of 
forms 25. 

Absorption of photon 44 , quantum 
theory of a. 107, 224, 261, a. lines 45. 

Action of material field 21 1, of electro- 
magnetic field 215, total 216, 222. 

Adaptation of co-ordinate system to 
sub-space 3. 

Addition of vectors 1, of correspond- 
ences 6, of matrices 7, of repre- 
sentations 126 , of elements of an 
algebra 165, 303, of numbers of a 
field 302, direct sum of algebras 311 . 

Affine correspondence 5 , see Corre- 
spondence, linear ; a. geometry 1 ff., 
1 12. 

Algebra, general concept 303 , of group 
166 , 1 81, 286, simple 31 1, 313, 
semi-simple 316, order of a. 304 , 
modulus or principal unit 168, 304, 
basal units 168, 304, division a. 
(= field) 304 , 316, central of a. 167 , 
3 1 1, invariant sub-a. 167 , 289, 

generating unit of s.-a. 168, 291 , 
direct sum 311 , direct product 333, 
reduction into simple matric a. 167, 
309 ff., 315 ; — representation of a. 166 , 
304 ff., regular representation 289 , 
complete reduction of representation 
306; — a. of complex quaternions 182, 
of linear transformations 307, of 
symmetric transformations 282, 332 , 
its enveloping a. 284, reduction of 
a. of linear transformations 30 7 ff. 

Alkali spectrum 85, 86, 202, doublets 
in 204, with anomalous Zeeman 
effect 205. 

Alkaline earth spectrum 207, 246. 

Alternation 358. 


those in boldface to the pages where the 

boldface are defined 

Alternation law 20 7, 370. 

Atom, Rutherford’s model xiii, Bohr’s 
theory of a. 43, radiation on classical 
and Bohr theories 44, on quantum 
theory 104 ff., 256 ff., Hund’s vector 
model of a. 191, 244 ; see Spectrum. 

Automorphism 115 , automorphic corre- 
spondence of group 134 , 

Auxiliary quantum number, see under 
Quantum number. 

Azimuthal quantum number, see under 
Quantum number. 

Balmer 45. 

Bessel’s inequality 33, for system of 
representations 169. 

Black body radiation 41, 104, 256. 

Bohr, H. 39. 

Bohr magneton 66, 205. 

Bohr, N. xiii, 43 > 95 . *° 5 , 236, 245. 

Boltzmann 108. 

Born 48, 74. 

Bose 50. 

Bounded Hermitian form 39. 

Brackett 46. 

Branching rule, for spectra 207, for 
linear and permutation groups 390 ff. 

de Broglie, L. 48, 53, 21 1, 220. 

Burnside’s theorem 153, 

Canonical variable 52, 94, c. trans- 
formation 96 . in quantum mechanics 
98, c. aggregate 79, c. basis for 
rotations in ray space 274. 

Central, of group 118 , of algebra 167 , 
313 . 
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Character, group or group character- 
istic 150 , 327, 395, of unitary re- 
presentation 156, primitive c. 150, 
150, behaviour on addition and 
multiplication 15 1, orthogonality pro- 
perties 156, 159 ff., 3 ^ 7 - For char- 
acters of special groups see utider 
qualifying adjective. 

Character of element of algebra 295. 

Characteristic number of Hermitian form 
or operator 21, 35, of unitary form 26, 
multiplicity of c. n. 22, 26, of energy 
56, 80 ; — characteristic vector or func- 
tion 21, 35, of wave equation 56, 80 ; 

— c. space 22, of energy 80, 192, of 
moment of momentum 189, 192. 

Class of conjugate elements 118, in 
symmetric permutation group 328 ; 

— c. function 150, 156, as element in 
central of group algebra 169. 

Classical mechanics compared with 
quantum mechanics xiii, 73 > 8 1 , 94, 
190, “ c.” combination principle 47, 
82. ! 

Clebsch-Gordan series 128, 163, 190, 
371, as quantum rule for composi- 
tion of moment of momentum 190, 
as valence rule 371. 

Closed shell 86, 245. 

Cogredient transformation 5. 

Collision phenomena 46, 70 ff. 

Combination principle, Ritz-Rydberg 
44, 48, 82, “ classical ” 47, 82. 

Commutation rules, Heisenberg’s 94, 
274, interpretation of 275, wave 
equation derived from c. r. 277 ff., 
c. r. for infinitesimal rotations 178, 
for moment of momentum 179, for 
spin 227, in second quantization 249, 
for Maxwell-Dirac equations 254 ff. 

Commutative field 302, c. group 118, 
c. operators transformed simultane- 
ously to principal axes 25. 

Commutator 177, 264, 267. 

Commutator form 273. 

Completeness of unitary-orthogonal sys- 
tem of functions 3 , of spherical 
harmonics 62, on group manifold 
170, c. of system or unitary repre- 
sentations 140, 159, 170, 305, 318, 
of product representation 164; — com- 
plete system of orthogonal vectors in 
3-space 257. 

Complete reduction of correspondences 
or representation 9 , 122 , sometimes 
equivalent to reduction 18, 123, 136, 


292, 301, 306, 308, of product re- 
presentation 140, of (Sf X ($p 128, 
190, uniqueness 136, 156, c. r. of 
system space with respect to energy 
80, of representation induced in 
system space by t s 188, of group 
space 294, of tensor space 301, of 
an algebra into simple matric algebras 
167, 309 ff., 315 - 

Composition of physical systems 91, 
behaviour of energy on c. 92, 193, of 
moment of momentum 190, c. of 
equivalent individuals 239, 24 1 , under 
Pauli exclusion principle 244, method 
of c. compared with second quantiza- 
tion 248; — c. of transformations 6, 

1 10, see Multiplication. 

Composition series, of sub-groups 132 , 
of sub-spaces 122, 135. 

Compton effect 224. 

Condon 74. 

Congruent modulo sub-space 4. 

Conjugate of element of group 118 , 
for permutation group 328, of ele- 
ment of algebra 107. 

Conjugation 118. 

Conservation law, for electricity 214 ff., 
energy 82, 218, 220, momentum 218, 

220, moment of momentum 188, 

221, Dirac’s c. 1 . 227, of quantum 
field 264 ff. 

Contact transformations 96 . 

Contragredient transformation 12 , re- 
presentation 123 . 

Contravariant vector 13. 

Convex region 79. 

Co-ordinate system, in vector space 2 , 
adapted to sub-space 3, transforma- 
tion of c. s. 4, normal c. s. 16 , 
21, Heisenberg’s c. s. 80 , in special 
relativity 147, in general relativity 
219. 

Correspondence or transformation, 
general 110, identical 110, inverse 
hi, product in, isomorphic 112, 
automoiphic 134 , similarity 283; — 
linear 5 ff., 21, = projection 282, 
in function space 35, trace 11, 150, 
dual 13 , 123, contragredient 12 , 

Hermitian 18 , unitary 16 , infinites- 
imal unitary 28 ff., rotation of ray 
space 20, X -multiplication 90, re- 
duction and complete reduction 9, 
irreducible system of 1. c. 122, 153 ff., 
symmetric c. in tensor space 282 * 
For special groups of correspondences 
see under qualifying adjective. 
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Correspondence principle 95. 

Coupling, Russell-Saunders or (si) 206, 

(;>') 206. 

Courant 40. 

Covariant linear quantity 178 , in 
quantum mechanics i97;~ c. vector 

13- 

Cycle of a permutation 328. 

Cyclic group 1 * 7 - 
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tion of e. c. 214, 217, e. dipole moment 
83, 104, 197. 

Electro-magnetic field, effect on charged 
particle 98, 213, 222, interaction with 
matter 105, 261, equations of 102, 
218, quantization 104, 253, action 
215. 

Electron, de Broglie’s equation for e. 

53 . 


Davisson 50 , 53, 7°* 

Decomposition, see Complete reduction, 
of space 3 , * 22 , of dual s P ace I4 ’ 
in unitary geometry 18, into char- 
acteristic spaces, 22. 

Degenerate system 83, perturbation 
of 86, . accidental degeneracy 192. 


of a 


Degree of a representation 120 
8-function 36, 255. 

Derivative of operator 94. 

Dimensionality of space 2 
representation 120. 

Dirac 109, 210, 21 1, 217, 225, 255? 2 ^°> 
262, 357- 

Dirac’s relativistically invariant equa- 
tions for electron 213, 218, 225, in 
central field 227 ff., quantization of 
2S3 ff, ; — D. theory of proton 262, 

Directional quantization 67, 75, 205. 
Dispersion S 3 , 224. 

Division algebra (= field) 304 , 316. 

Double tensor 347* 

Dual space 12 , matrix 13 , system of 
transformations 123, symmetry eie- 
ment and representation 35 2 , sym- 
metry pattern, 361, 369. 

Dynamical variable, represented by 
Hermitian form 74 , 275 . measure- 
ment of 74 ff-, mean value or ex- 
pectation 75, intensity on transition 
83 , 197, composition 91, totality 01 
d v. represented by irreducible system 

23 8; — d. law 54, 80 ff., 97, 187, 266. 
Dynamically independent systems 92. 

. Effective quantum number, see under 
Quantum number. 

Einstein 42, 5 °- 
Electric charge, atomicity of 216, posi- 
tive and negative 262,. e. c. density 
and current density 215, conserva- 


213, e. beams 50, spin 195 , 196 , 
203, 276, translation 196 , in spher- 
ically symmetric field 63, 227, nega- 
tive energy levels and “ positive e. 
225, existence vs. constitution of e. 
261, e. and proton 262. 

Element, of group 114 , of group alge- 
bra 166 , of algebra 303 , 
potent e. 168, 291 , independent 29 Z, 
primitive 293 , real 295, trace 299 , 
317 , scalar product 299 , character 
of an e. 295. 

Elsasser 74 . 


Emission, of photon 44, quantum 
theory of e. and absorption 107, 224, 
261, spontaneous 107, stimulated 
108. 

Energy, and its operator 5 1 80 ff., 

97, 187, 215, e. level 44 , 5 °, in 
collision phenomena 70, in perturba- 
tion theory 86 ff., on composition 9 2 , 
in electro-magnetic field 101, with 
spin 215, 220, e. of radiation field 
103, 258, e. of simple state 189, ^i, 
of svstem of equivalent individuals 
320 ff., 356, of molecule 346, ex- 
change e. 322, 342, 346 , e. and 
momentum 51, 218, 220, conserva- 
tion 188, zero-point e. 104, 258, 201, 
inertia of e. 221, e. quantum 41 • 

Enveloping algebra 284, for double ten- 
sors 348. 

Equality, axioms of 112. 

Equivalence degeneracy 239 ff., 3 2 °* 
Equivalent individuals, state of system 
consisting of e. i. 239 ff., energy 241, 
320 ff., 356, quantization 246. 

Equivalent svstems of linear transforma- 
tions 121, e. representations 120 
sub-spaces 135 , 283, e. points with 
respect to transformation 112, e. 
elements with respect to sub-group 
118. 

Euclidean geometry 15, 112. 

Exchange energy 322, 342, 346. 
Expectation or mean value of physical 
quantity 75, 78, 92 - 

Exponential function 28, of matrix, 29. 
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Factor group 119 , 132. 

Faithful realization 114 . 

Ferro- magnetism 347. 

Field equations, for electro-magnetic 
field 102, 218, for matter 213 ff., 
their quantization 104 if., 253 ff. 

Field, number f. 294, 302 , algebraically 
closed 294 , commutative 302 , finite 
f. of modulus p 303 ; — ray f. 20 , 
vector f. 20, point-f. no. 

Fine structure, in hydrogen 203, 236 

f. s. constant 216 . 

Form, linear 12, bi-linear 13, 16, 18, 
Hermitian 18 , unitary 16 , commu- 
tator 273, anti-symmetric bi-linear 
273 , 397 - 

Fourier coefficient 33, series 33, in- 
tegral 39, F. c. or group matrix for 
representation 165. 

Franck 46, 70, 74. 

Frequency 50, Bohr’s f. rule 47, 105, 
109. 

Frobenius 156, 358, 383. 

Function space 32, of quadratically 
integrable functions 143. 


118 sub-g. 116 , indcx of sub-g. 
118 , self-conjugate or invariant sub-g. 

119 132. factor g. 119 , simple 182 , 

direct product 127 , closed continuous 
160 ff., Lie theory of continuous g. 
175 g. maiufolu ioon., invariant 
sub-space of g. manifold 291 ; -—realiz- 
ation of g. 114 , representation of g 
120 , of sub-g. 127, , 334 > of direct 
product 333, p- algebra 

of g 186 , lol, 286. For special 
groups, see under qualifying adjectives. 


Gurney 74* 

Gyro-magnetic effect 205. 


Hallwachs 42. 

Hamilton 50, 138. 

Hamiltonian equations, in classical 
mechanics 96, 98, in quantum mech- 
anics 94, in quantum held theory 253. 

Heisenberg xiii, 48, 80, 82, 222, 264, 
347 - 

Heisenberg’s co-ordinate system 80 . 

Heisenberg- Pauli theory of the quantum 
field 253 ff. 

Heitler 342. 


Galois, 132. 

F-process 126. 

Gamow 74. 

Gauge invariance 100 , 213, 220, rela- 
tion to conservation of electricity 214, 
217, r 61 e in quantization 256, 271. 

Generating function of infinitesimal 
canonical transformation 97. 

Generating unit 291 , independent 292 , 
in field of complex numbers 295, of 
symmetry class of tensors 296. 

Geometry, affine or vector 1 ff., 112, 
Euclidean 15, 112. unitary 15 ff., 
characterized by group 112. 

Gerlach 65, 75. 

Germer 50, 53, 70. 

^-factor, Land 6, 204, 205, 207. 

Goudsmit 203. 

Group noff., transformations g. Ill, 
abstract 114 ff., isomorphic 115 , 
automorphic correspondence of g. 
115, 134 , commutative or Abelian 
118 , cyclic 1 17, order of finite g. 
118 , of element of g. 117, central 


Hellinger 39, 40. 

Hermite 18. 

Hermitian form or operator 18 , non- 
degenerate 18, positive definite 18, 
unit 15, idempotent 23 , in function 
space 35, 37, bounded 39, product 
of H. 1. 20, • trace 20, characteristic 
number 21, 35, transformation to 
principal axes for single II . f. 21 ff., 
32, for Abelian system 25 ; — H. f. 
represents physical quantity 74, 275, 
characteristics statistical aggregate 
79 » 239; — H. conjugate 17 . 

Hermitian polynomials 57 ff. 

Hertz, G. 46, 70, 74. 

Hertz, H. 42. 

Hilbert 39. 

Hilbert space 32. 

Hund’s vector model of the atom 191, 
244. 

Hydrogen atom 45, on Schr&dinger's 
theory 63 ff., on Dirac's theory 234 ff., 
spectrum 45, 69, fine structure 203, 
236. 
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Idempotent Hermitian form 23, 37, 
independent 23 ; — i. element of an 
algebra 168, 291 , independent 292 , 
primitive 293 . 

Identity correspondence 6, 110 , repre- 
sentation 1 21. 

Independent, linearly i. vectors 2, i. 
idempotent forms 23, idempotent 
elements of algebra 292 . 

Index of sub-group 118 . 

Infinitesimal unitary transformation 28 ff., 
rotation 27 ff., moment of momentum 
induced by i. r. 178, canonical trans- 
formation 96, element of continuous 
group 160, 177. 

Inner quantum number, see under 
Quantum number. 

Intensity, as measure of probability 49, 
i. of dynamical variable on transition 
83 , 197, of spectral lines 44, 83, 232, 
in anomalous Zeeman effect 201. 

Interaction between matter and radia- 
tion 104 ff., 261. 

Interchange, of right and left 225, of 
past and future 109, 227, 263. 

Invariance, in special relativity, dif- 
ficulty for quantum mechanics 54, 
Dirac’s treatment 210 ff., i. of 
quantum field equations 268 ff. ; — in 
sense of general relativity 219, under 
change of gauge 100 , see Gauge 
invariance. 

Invariant of transformation group 117, 
170 , in representation space 17 1, 
classical theory 170 ff. 

Invariant sub-space 8, under system 
of transformations 122, 135, 282, 

left-i. s.-s. in group space 289 ff., left- 
and right-i. s.-s. 168, 31 1, in tensor 
space 296 ff, significance in quantum 
theory 320 ; — i. sub-group 119 , 
maximal 132. 

Inverse correspondence 6, 111 , element 
of group 1 14. 

Involution 13. 

Ionization potential 46. 

Irreducible invariant sub-space 122 , 282, 
system of linear transformations, re- 
presentation 122, reduction into i. 
constituents 122, 135 ; — irreducibility 
= complete irreducibility in unitary 
domain 136, 292, 301, for reducible 
algebra 305, for algebra of trans- 
formations in completely reducible 
vector space, 307. 


Isomorphic correspondences 1 1 2 
simply isomorphic groups 115 . 

Jeans 42, 102, 103. 

(jj) coupling 206. 

Jordan-Holder theorem 131 ff. 

Jordan, P. 261, 280. 

Kinematically independent systems 92, 
190, perturbation of 93. 

Kinematics of system determines repre- 
sentation in- system space 189, 
Heisenberg’s quantum k. 94 ff, as 
Abelian group of rotations 272 ff., 
in second quantization 250, k. of 
spin 195, 203, 276. 

Klein’s Erlanger programme xv, 112, 

Laguerre polynomials 70. 

Land£, 204, 208. 

Laporte’s rule 201, 203. 

Legendre polynomials and associated 
functions 62, with spin 230. 

Lenar d 42. 

Leonardo da Vinci 112. 

Lie 176. 

Light, wave and corpuscular nature of 
48 ff., 53. 

Linear, 1 . algebra 303 , see Algebra ; — 
1. correspondence 5 , see under Corre- 
spondence; — 1. form 12, 1. covariant 
quantity 173 , 1 . projection = 1 . cor- 
respondence 282, 1 . sub-space 2; — 
1. momentum, see Momentum, linear. 

Linear group, complete c n 123, simplest 
representations 123 ff, representa- 
tion (£y of c 2 128 ff., its ir- 
reducibility 299, representation (&f, g 
13 1, 164 ; — reduction of (c) f equivalent 
to reduction of algebra of symmetric 
transformations 284 ff., unitary re- 
striction immaterial 285, result of 
the reduction 301, characteristics 
335 ff,, relation to characters of 
symmetric permutation group 326- 
representations of order / 309, 

branching law 391. 

London 342, 346, 370. 

Lorentz group, restricted, obtained from 
c 2 147 ff., complete L. g. obtained on 
adding reflection 147, positive and 
negative transformations 147, and 
Dirac’s equations 212 ff., transforma- 
tion induced in system space 268 ff. 

Lyman 45. 
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Magnetic quantum number, see under 
Quantum number. 

Magneto-mechanical anomaly 205. 

Magneton, Bohr 66, 205. 

Magnitude, absolute, of vector 16, 19. 

Mapping no, see Correspondence, 
Transformation. 

Matric algebra,» simple 168, 313. 

Matrix 7 , dual or transposed 13 , unit 
6, addition 7, multiplication 8, re- 
duced and completely reduced’ 9, 
transformation of m. 8, norm 11, 
trace 11 ; — group m. 165 . 

Maxwell's equations 102, 218, quan- 
tization of 104 ff., 253, M. action 215. 

Mean value or expectation of physical 
quantity in pure state 75, 78, 92, in 
mixed case 79; — m. v. over group 
manifold 158. 

Measurement of dynamical variable 74 ff 

Metric 15. 

Millikan 42, 245. 

Minkowski, H. 79. 

Mixed state 79. 

Modules, of algebra 168, 304, reduc- 
tion of 168, 301 ; — of finite field 303 , 

Molecule, spectrum 19 1, perturbation 
theory ana constitution 339 ff M non- 
polar bond 342, London formula 
for binding energy 346, on taking 
account of Coulomb forces 356, val- 
ence theory 369 ff. 

Moment of momentum of a representa- 
tion 179 , of D* 179 ; — m. of m. of phy- 
sical system 187 , orbital 64, 195 , 
spin 195 , 203, 218, behaviour on 
composition 190, conservation 188, 
219 IF., 227, reduction of system 
space with respect to m. of m. 192, 
induced by infinitesimal rotations of 
Lorentz transformations 185, 269. 

Momentum, linear, and its operator 51, 
220, conservation of energy and m. 
218, 264 ff. 

Moseley’s law 69. 

Motions, geometrical 1 1 1 , group of 1 76. 

Multiplet 196, 206 , 373, as relativis- 
tic phenomenon 204, 234, normal 
Zeeman effect 101, 193, 198, anom- 
alous Zeeman effect 204,. 208 ff., 
alkali doublets 204, singlets and 
triplets in alkaline earths 207, 246, 


multiplicity 321, 350, under Pauli 
exclusion principle 352, in 2-dimen- 
sional spin 355, 369, multiplicity and 
valence 369 ff., branching rule and 
alternation law 207, 370. 

Multiplication, of vector by number 1, 
of correspondences and matrices 6 ff., 
of numbers of field 302, of elements 
of algebra 165, 303, quaternion m. 
138, outer or X -m. of spaces, vectors, 
operators 90 , 125, of representations 
126 , direct product of groups 127 , 
333, of algebras 333, X -m. of repre- 
sentations 127 ; — scalar m. of vectors 
16 , of elements of an algebra 299 , 317. 

v. Neumann 40, 78. 

Noether, E. 134. 

Normal co-ordinate system 16 , in rel- 
ativity 147, n. state of atom 45, 
n. term order 206. 

Number, of field 302 , operations on 302 ; 

— characteristics n. 21. 

Operator = linear correspondence 6, 
Hermitian. 18 , in function space 35, 
representing dynamical variable 55, 
considered as function of time 81, 
derivative of 0. 94. 

Orbit, in older quantum theory 47, 
orbital moment of momentum 64, 
195 . 

Order, of finitje group 118 , of element 
of group 1 17, of sub-group 118, 
of finite algebra 303 . 

Orthogonal group, see Rotation group ; 

— o. transformation 16, o. vectors 16. 

Orthogonality relations 32, for group 
characters 159 ff„ 317, for sym- 
metric permutation group 367. 

Oscillator 43, .56 ff., 84, black body 
radiation as system of o. 102 ff., 258, 
quantum mechanical laws of system 
of o. 249. 

Parseval’s equation 33, 35, 162. 

Paschen 45, 236. 

Paschen-Back effect 208. 

Pattern, symmetry, see Symmetry 
pattern. 

Pauli 77, 203, 21 1, 244, 264, 347, 351. 

Pauli exclusion principle 207, 244 ff., 
and reduction of algebra of sym- 
metric transformations 281, 323, 347 ff., 
355 , 370 ff 
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Peirce reduction 312. 

Periodic system of the elements 69, 
242 ff. 

Permutation 11, reduction into cycles 
328, conjugate 328, as operator on 
tensor 281. 

Permutation group, symmetric 121 , 
classes 328, elements as symmetry 
operators 286 , relation to symmetry 
class of tensors 286 ff., for arbitrary 
p. g- 33 2 > characters 320, 3S3 if., 
relation to characteristics of unitary 
group 331, use of characters to 
calculate exchange energies 322 ff., 
energy of non-polar bond 346, ex- 
plicit theory of representations 358 ff., 
reciprocity theorems 339, branching 
law 390. 

Perturbation theory 86 ff., for kine- 
matically independent systems 93, 
for equivalent individuals 321 ff., 
for molecules 339 ff. ; — p. energy 86, 
for axially symmetric field 192, for 
magnetic field 101, 193, 204, 224, 
for electric field 101, 224, spin p. 196, 
in Dirac theory 224, determines 
transition probability 89. 

Pfund 46. 

Photo-electric effect 42. 

Photon 42, 49, 54, 104, 248, 258, 261. 

Planck xiii, 41. 

Planck’s radiation law 41, 108. 

Point-field no. 

Polynomial, characteristic 1 1, 22 ; — Her- 
mitian 57 ff., Legendre 62, with 
spin 230, Laguerre 70. 

Primitive unit 293 , character 150, 
symmetry class 358. 

Principal unit of algebra 168, 304; 

— p. transformation 128, transforma- 
tion of Hermitian forms to p. axes 21, 
25, 3 2 > 39> for unitary forms 26, 39; 

— p. quantum number, see under 
Quantum number. 

Probability, relation to intensity 49, 
that a dynamical variable assume a 
given value in a pure state 75, in a 
mixed state 79, p. density and current 
density 50, 215, 217 ; — transition p. 73, 
83, 89, in composite system 90, 93, 
for an atom in radiation field 106 ff. 

Product, see Multiplication. 

Projection, with respect to sub-space 4 , 
in unitary geometry 18, orthogonal 


and unitary-orthogonal 23, linear 
p. — linear correspondence 282. 

Proton, Dirac’s theory of 262. 

Pure state 75, conditions for 77 . 

Quantization, in the older quantum 
theory 47, in Schrodinger’s theory 
51, 56, in Heisenberg’s 93 ff., of 
composite system 89, of electro- 
magnetic field 104, 253, second 246, 
of Maxwell- Dirac field equations 
2 53 ff- ; — directional or space q. 67, 75, 
205. 

Quantum, of action 41,51, of energy 4 1 . 

Quantum kinematics, Heisenberg’s 94 ff., 
as Abelian group of rotations 272 ff., 
in second quantization 250. 

Quantum mechanics, general scheme 
74 ff., dynamical law 54, 80, 97, 187, 
266, composition 91, Heisenberg’s 
formulation 93, Schrodinger’s equa- 
tion 54, 10 1, Dirac’s equations 21 3, 
218, Heisenberg- Pauli q. m. of wave 
fields 253 ff. 

Quantum number, auxiliary (k) 228 , 
selection rules 233, relation to azi- 
muthal and inner q. n. 228, 233 ; — 
azimuthal q. n. (/, L) 64 ff., 142*, 196, 
determines orbital moment of mo- 
mentum 65, 196, selection rules 84, 
201, on composition 194, 207, 373, 
relation to auxiliary q. n. 228, 233 ; 

— inner q. n. (j,J) 189 , 196, deter- 

mines total moment of momentum 
179, 189, behaviour on composi- 

tion 190, 194, 206, selection rules 
198, relation to auxiliary q. n. 228, 233 ; 

— magnetic (m) 64, 193 , determines 
z - component of moment of momentum 
65, 180, 189, selection rules 85, 198, 
of spin and of orbital moment of 
momentum 209, in Dirac’s theory 
232 ; — principal or total ( n ) in hydro- 

en 69 , in hydrogen-like spectra 85, 
as no group-theoretic significance 
144, true 86, 243, effective 243; — 
radial 64, 144; — spin (.r) 206, re- 
lation to valence 369. 

Quantum state 43, 56, 80 , 188, simple 
189 . 

Quaternion 138, complex 182. 

Radial quantum number, see under 
Quantum number. 

Radiation, from atom 44, 83 ff., 105 ff., 
224, field 102 ff., 215, 256 ff., black 
body 41, 104. 
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Ray 4 , 20, represents state of physical 
system 75, r. field 20, rotations of 
r. field 273, r. representation 181 ff. 

Rayleigh 42. 

Real element of algebra 167, gener- 
ating unit 295. 

Realization of group 114 , faithful 114 , 
contracted 118, 119, of algebra 166 ; 
— linearr. — representation 120, see 
Representation. 

Reciprocity theorem, for arbitrary group 
338, for permutation group 339. 

Reduction of correspondences or re- 
presentation 9, 122, uniqueness 136, 
156, complete r. 9 , 122, 135 (see 
Complete reduction), sometimes im- 
plies complete r. 18, 123, 136, 292, 301, 
306, 308, of regular representation 
289 ff, 305 ff., of system space of 
equivalent individuals 238 ff., anti- 
symmetric r. for electrons 242, 351 ff., 
symmetric r. for photons 248, 351 ff, 
influence on term spectrum 241, 372 ff , 
general treatment without spin 296 ff., 
with spin 347 ff., for symmetric and 
anti-symmetric cases 351 ff 

Reflection, signature induced by r. 143, 
146, 188. 

Regular representation 289 , reduction 
305 & 

Relativity theory, special 51, 98 ff, 146 ff, 
of quantum mechanics 210 ff., of 
wave fields 268 ff., r. and spin 204, 217, 
222 ff., ; — general 219. 

Representation, of finite group 120 , 
of continuous group 160 ff., by ro- 
tations of ray space 18 1, degree or 
dimensionality 120, character 150 , 
complete reduction 122, irreducible 
122, uniqueness of reduction 136, 
156, criterion for irreducibility 159, 
identical 121, equivalent 121, unit- 
ary 136 ff, any r. equivalent to unitary 
r - 157 ; — formal processes: addition 
126 , X -multiplication 126 , 127, X- 
multiplication 127, /"-process 126, 
r. of sub-group 127 ; — of algebra 166 , 
304 ff., regular 289 ; — general theory : 
orthogonality properties 157 ff., 317, 
in terms of group algebra 165 ff, 
completeness of system of r. 159, 170, 
318, proved by reduction of regular 
r. 305 ff. For r. of special groups, 
see under qualifying adjective. 

Resonance, between states of same energy 
87, between equivalent individuals 
239 ff , 320. 


Resonance line 45. 

Ritz-Rvdberg combination principle 44, 
48, 82. 

Rontgen 43. 

Rotation group, in 2-space and its re- 
presentations 140 ff., orthogonality 
of characters 162; — in 3-space 
and its representations 142 ff., rela- 
tion to unitary group in 2-space 144, 
augmentation by improper rotations 
143, orthogonality of characteristics 
163, completeness 143, 163, 180, 184, 
389, generated by infinitesimal ele- 
ments 175, representation induced in 
system space 185, 195,372; — in 

n -space 184. 

Rotation in ray space 21, 181, 273, 
representation by r. of ray field 180, 
quantum kinematics as Abelian group 
of r. 2 72 ff. 

Rupp 50. 

Russell- Saunders coupling 206. 

Rutherford xiii, 74. 

Rydberg number xiii, 45, 69. 


Scalar product, see Multiplication. 

Scalar quantity, commutes with moment 
of momentum and signature 188, 
selection rules 197. 

Schrodinger 48, 50, 56, 102, 187, 216, 
220, 258. 

Schrodinger’s equation 54 ff., relativ- 
istic jo 1, for system of equivalent 
particles 194, as limiting case of 
Dirac’s 234, derived from com- 
mutation rules 277 ff. 

Schur, I. 152. 

Schwarz’ inequality 30, 393. 

Second quantization 246, see under 
Quantization. 

Secular eq uation II , 2 1 , 26, in quantum 
theory §8, 209, 344. 

Selection rules 44, 84, 85, for oscillator 
84, for electron without spin 84 ff., 
with spin 232, for scalar quantity 197, 
for vector quantity 197, for auxiliary 
quantum number 233, azimuthal 84, 
20.1, inner 198, magnetic 85, 198, 
for signature 201. 

Self-conjugate sub-group 119 , maxima 

132. 

Semi-simple algebra 316. 
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Separation of terms by perturbation 87, 
321, axially symmetric perturbation 
193, in normal Zeeman effect 101, 193, 
198, in anomalous Zeeman effect 204, 
208 ff 

Series, in hydrogen 45 > 6 9 , in alkalies 
85, 202. 

Series of composition, see Composition 
series. 

Signature, of representation 143 , as 
dynamical variable 188 , 203, selec- j 
tion rule 201. 

Simple algebra 31 1, 313. g rou P 132 > 
state 189 . 

(si) coupling 206. 

Smekal-Raman effect 224. 

Sommerfeld 193, 236. 

Space, affine, linear, vector 1 ff., linear 
sub-s. 2, dual 12, unitary 15 ff., 
Hilbert or function 32, 143, reduction 
or decomposition 20, 22, composition 
series 122, 135, product 90 , tensor 
125, 281 ff., group s. 1 15, 160, re- 
presentation 120, 17 1 ff., algebra as 
vector s. 286, 305, system, see System 
space, 

Space quantization 67, 75 ? 20 5 - 
Span, space spanned by vectors 3, 20. 

Spectrum, atomic, line s. reduced to 
term s. 44, of hydrogen and 1 -electron 
ions 45, in Schrodinger’s theory 69, 
in Dirac’s theory 234, of alkalies 85 ff., 
doublets 204, of alkaline earths 207, 
246, 3-electron 374, of elements of 
periodic table 206 ff., 242 ; — general 
theory, without spin 194, with spin 
206 ff,, application of Pauli ex- 
clusion principle 242 ff., group- 
theoretic classification 369 ff., re- 
duction into term classes 283 ff,, 320 ff., 
calculation of term values 320 ff. ; 
— molecular 1.9 1 ; — of characteristic 
numbers 36. 

Spherical harmonics 60 ff, 84, as basis 
of unitary representation in function 
space 142, >vith spin 230 ff. 

Spin, electron 195 , 196 , 203, as relativ- 
istic phenomenon 204, 217, 222 ff., 
s. moment of momentum 195 , 221, 
magnetic effect 204, 224, s. and 
valence 369 ff. ; — s. perturbation 196, 
203, in Dirac’s theory 222 ff. ; — s. 
quantum number, see under Quantum 
number. 

Stark effect, linear 102. 
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State of a physical system, represented 
by vector or ray in system space 54, 

74 ff., pure 75, 78/ mixed 79, of 
total system under-determined 92 ; — 
quantum or stationary 43, 56, 80 , 1 88, 
simple 189 . 

Stationary state, see under State. 

Statistical aggregate 78, 239, canonical 
79- 

Statistics, Bose-Einstein 50. 

Stern-Gerlach effect 65, 75, 205. 

Stieltjes integral 37. 

Stoner’s rule 243. 

Sub-algebra, left-invariant 289, (left- 
and right-) invariant 167, 31 1, 3 X 4* 

Sub-group 116, 334 ff., cyclic 117, 
index 118, self-conjugate or invariant 
119, maximal invariant 132. 

Sub-space 2 , 32, invariant, under single 
transformation 8, under system of 
transformations 122, equivalent or 
similar 135 , 283, see also Invariant 
sub-space. 

Substitution in, see Correspondence. 

Sum, see Addition ; — s. rule for influence 
of magnetic field, 209. 

Superposition principle 49. 

Symmetric permutation group, see Per- 
mutation group, symmetric. 

Symmetric transformation in tensor space 
282 , special 284, Hermitian 283, 
unitary 285, enveloping algebra 284, 
for arbitrary permutation group 332. 

Symmetrization 358. 

Symmetry class of tensors 287 , 296, 
primitive 358, of spectral terms 3 2I > 
multiplicity 321, 350 ff., 367. 

Symmetry operator 286 , Young’s 359 . 

Symmetry pattern 358 ff., dual on trans- 
posed 361, 368, generated by Young 
symmetry operator 359 ff. 

System space for translation 54, 74, 195, 
for spin 195, total 185, 196, 347 
for equivalent individuals 186, 206 ff., 
347 ; — reduction with respect to 
energy 80, moment of momentum 
188, 206, with regard to symmetric 
permutation group 283 ff., 3 20 
with regard to Pauli exclusion prin- 
ciple 242 ff., 281 ff., 347 ff- 
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Tensor 125 ff., 139, 281, symmetry 
class of t. 287, 338, 358, double to 
347 ; — t. space 125, 281 ff., symmetric 
transformation in t space 282, in- 
variant sub-space 296, reduction 301 ; 
— energy-momentum t. 218. 

Term 44, as energy level or character- 
istic number 46, 56, 80, see also under 
Spectrum, Separation ; — t. order, 
normal 206. 

Thomson, G. P. 50. 

Total quantum number, see under 
Quantum number. 

Trace, of matrix or correspondence 11, 
150, of dement of algebra 299, 317. 

Transformation, linear 4 = Correspond- 
ence, linear ; — contragredient 12, unit- 
ary 16, principal 128, symmetric in 
tensor space 282, for arbitrary per- 
mutation group 332, spetial sym- 
metric 284, canonical 96, in 
quantum mechanics 98 ; — t. to principal 
axes 21 ff , 37 ; — t. group 111, for 
sperial groups, see under qualifying 
adjective. 

Transition probability 83, 89, in radia- 
tion held 106 ff 

Translation, left- 116, right- 116. 

Translation, electron 195. ' 

True quantum number, see under 
Quantum number. 


Uhlenbeck 203. 

Uncertainty principle 77, derivation 
393- 

Unimodular linear transformation, group 
128. 

Unit, element of group 1 14, of field 302, 
of algebra (modulus or prindpal unit) 
304, basal 168, 304, idempotent 

generating 168, 291, independent 

292, primitive 293, real 295 ; — u. 
Hermitian form 15. 

Unitary correspondence, transformation, 
matrix 16 ff , characteristic numbers 
26, infinitesimal 28, u. geometry 
15 ff., u. t. as canonical t. of quantum 
mechanics 98, u. representation of 
group 137 ff 


Unitary group, in 2-space 137 ff., its 
unitary representations (S/ 137, com- 
pleteness 137, 163, 389, character- 
istics 1 51, 163, connection with ro- 
tation group bs 144, augmented 146 ; 
— in »-space 139 ff., reduction of (u)/ 
and algebra of symmetric transforma- 
tions 285, characteristics 331, 381, 
completeness 381. 

U nitary-orthogonal system of vectors 
or functions 19, 33, completeness 33, 
on group manifold 158. 

Valence 342, 369, v. electron 86, 243 

Vector, v. space, v. geometry Iff., in 
Hilbert space 31 ff, v. field 20, co- 
variant and contravariant 13, absolute 
magnitude 16, dual 17, scalar pro- 
duct 16, unitary-orthogonal v. or 
system 16, 19, as element of Abelian 
group 134; — 3-v. operator in quantum 
mechanics 197, selection and intensity 
rules 198 ff, complete system of 
orthogonal v. in 3 -space 257, v. 
potential of electro-magnetic field 98. 

Vector model of atom, Hund's 191. 

Velocity, phase and group 53. 

Volume, measure of, on manifold of 
closed continuous group 160, for 
unitary group 386, for unitary uni- 
modular group 162, 389. 

Wave equation, de Broglie's 53, 
Schrodinger’s 54 ff, iot, Dirac's 213, 
218, 225. 

Wave field, Heisenberg- Pauli quantiza- 
tion of 253 ff 

Wave length 53. 

Wedderbum's theorem 313. 

Wentzel 74. 

Wien 41. 

Wigner 280, 320. 

Wintner 39. 

Young, A. 358. 

Young's symmetry operator 359 . 

Zeeman effect, normal 85, iox, 193, 198, 
anomalous 198, 204, 208, 223, for 
doublets 204, for multiplets in gene- 

I ral 208 ff. 




